Documentation: gdutils

A collection of geodata utility tools.

Available modules:

  • gdutils.datamine: a python module for mining and listing data sources.
  • gdutils.dataqa: a python module for analyzing and comparing data for QA purposes.
  • gdutils.extract: a script and python module for extracting tabular data for data science (data wrangling) purposes. A user-friendly, lite wrapper of geopandas.

gdutils.datamine

Provides
  • A python module for mining and listing data sources.

Metadata

Module:gdutils.datamine
Filename:datamine.py
Author:@KeiferC
Date:27 July 2020
Version:1.0.0
Description:Module for data mining

Documentation

Documentation for the datamine module can be found as docstrings. Run import gdutils.datamine; help(gdutils.datamine) to view documentation.

$ python
>>> import gdutils.datamine; help(gdutils.datamine)

Additionally, documentation can be found on Read the Docs.

gdutils.dataqa

Provides
  • A python module containing data quality assurance functions.

Metadata

Module:

gdutils.dataqa

Filename:

dataqa.py

Author:

@KeiferC

Date:

14 August 2020

Version:

1.0.0

Description:

Module for data quality assurance

Dependencies:
  • geopandas
  • gdutils.extract
  • pandas

Documentation

Documentation for the dataqa module can be found as docstrings. Run import gdutils.dataqa; help(gdutils.dataqa) to view documentation.

$ python
>>> import gdutils.dataqa; help(gdutils.dataqa)

Additionally, documentation can be found on Read the Docs.

gdutils.extract

Provides
  • A class ExtractTable (pronounced “extractable”) for extracting subtables from given tabular data. Can manage filetypes .csv, .xlsx, .geojson, .shp, etc.

  • A command-line script that can be used to

    1. convert input filetype to output filetype (ex. .shp -> .csv);
    2. output tabular data reindexed with a specified column label; or
    3. output subtables from input tabular data.

Metadata

Module:

gdutils.extract

Filename:

extract.py

Author:

@KeiferC

Date:

06 July 2020

Version:

1.0.0

Description:

Script and module to extract subtables from given tabular data

Dependencies:
  • geopandas
  • numpy
  • pandas

Documentation

Documentation for the extract module can be found as docstrings. Run import modules.extract; help(modules.extract) to view documentation.

$ python
>>> import gdutils.extract; help(gdutils.extract)

Additionally, documentation can be found on Read the Docs.

Script Usage

To get help on using the extract.py script, run extract.py -h.

$ python extract.py -h