extract.py¶

Script for extracting subsets of tabular data.

usage: extract.py [-h] [-o OUTFILE] [-c COLUMN] [-v VALUE [VALUE ...]] INFILE

If no outfile is specified, outputs plaintext to stdout. If no column is specified, outputs filetype converted input. If no value is specified, outputs table indexed with given column (required). If value and column are specified, outputs subtable indexed with given column and containing only rows equal to given value(s).

Tested supported input filetypes: .csv, .geojson, .shp, .xlsx, .zip

Tested supported output filetypes: .bz2, .csv, .geojson, .gpkg, .gzip, .html, .json, .md, .pkl, .tex, .xlsx, .zip. All other extensions will contain output in plaintext.

Positional arguments:

INFILE                name/path of input file of tabular data to read

Optional arguments:

-h, --help            show this help message and exit
-o OUTFILE, --output OUTFILE
                        name/path of output file for writing
-c COLUMN, --column COLUMN
                        label of column to use as index for extracted table
-v VALUE [VALUE ...], --value VALUE [VALUE ...]
                        value(s) of specified column in rows to extract

Examples:

python extract.py input.xlsx -c ID > output.csv

python extract.py foo.csv -o bar.csv -c "state fips" -v 01

python extract.py input.csv -o ../output.csv -c Name -v "Rick Astley"

python extract.py in.csv -o out.csv -c NUM -v 0 1 2 3