Mercury Learning and Information, 2022. — 292 p. — ISBN 168392973X.
This book introduces an assortment of powerful
command line utilities that can be combined to create simple, yet powerful shell scripts for processing datasets. The code samples and scripts use the
bash shell, and typically involve small datasets so you can focus on understanding the features of
grep, sed, and awk. Companion files with code are
available for downloading from the publisher.
FeaturesProvides the reader with powerful command line utilities that can be combined to create simple yet powerful shell scripts for processing datasets.
Contains a variety of code fragments and shell scripts for data scientists, data analysts, and those who want shell-based solutions to “clean” various types of datasets.
Companion files with code available for downloading with Amazon proof of purchase by writing to the publisher.
Introduction to Unix.
Files and Directories.
Useful Commands.
Conditional Logic and Loops.
Processing Datasets with grep and sed.
Processing Datasets with awk.
Processing Datasets (Pandas).
NoSQL, SQLite, and Python.
True PDF