This package contains a set of Python utilities for :
- easy data loading ;
- descriptive statistics iteration ;
- implementation of econometric methods.
The repository is organized as follow :
- the
s3module contains a set of functions for importing or exporting files in various formats from or to an s3 bucket on the Nubonyxia platform; - the
stats_desmodule contains functions to complement those integrated by default in pandas for descriptive statistics (addition of totals, weighted statistics, verification of statistical confidentiality, etc.). - the
preprocessingmodule contains a set of classes that can be integrated into asklearn.pipelineto perform various data transformation operations - the
model_selectionmodule contains functions for training a prediction model or estimating a regression model - the
estimatorsmodule contains econometric models that can be integrated into asklearn.pipeline - the
samplesmodule contains methods for sample weighting and statistical inference - the
economicsmodule contains functions to compute metrics, for industrial economics and concentration for instance - the
utilsmodule contains a set of utility functions on which other functions in this module depend, or for calculating weighted statistics, for example
git clone <repo_url>
pip install -e igf_toolboxThe package can then be used like any other Python package.
To visualize the documentation :
mkdocs build --port 5000
The package is licensed under the MIT License.