Tags

Tags give the ability to mark specific points in history as being important

2.1.8

f9b6b3c9 · Merge branch '23-integrate-serve_features-function-into-gitlabds-package' into 'main' · Feb 03, 2026
```
Add serve_features() functionality
```
2.1.7

f1ba2b07 · Merge branch 'add-multiclass-beeswarm' into 'main' · Aug 22, 2025
```
Add Beeswarm directional graphs for multiclass models
```
2.1.6

daa1e66b · Merge branch 'fix-json-double-double-quote' into 'main' · Aug 14, 2025
```
Fix dummy_apply formatting bug and futurewarning for modelevaluator
```
2.1.5

f04bfe9d · Merge branch 'add-created-at' into 'main' · Jul 16, 2025
```
at created_at field to model monitoring output dfs
```
2.1.4

8e91696c · Merge branch 'fix-scoring_date' into 'main' · Jul 15, 2025
```
Rename scoring_date to score_date in monitoring_metrics for consistency
```
2.1.3

76aab700 · Merge branch 'downgrade-packages' into 'main' · Jul 11, 2025
```
- Downgrade required package dependencies for better compatibility
```

2.1.2

28e659ca · Merge branch 'fix-column-order' into 'main' · Jul 10, 2025

- Fixed model monitoring metrics column order
- Removed duplicate prediction_drift_status

2.1.1

71fe38c5 · Merge branch 'threshold-tweaks' into 'main' · Jul 10, 2025
```
Add configureable prediction thresholds to baseline jsons
```

2.1.0

b3b3dedd · Merge branch 'model-monitoring' into 'main' · Jul 08, 2025

- Added monitoring metrics functions to allow for model observability
- Depreciated `model_metrics`
- Updated testing pipeline to allow for local testing
- Tidied up README

2.0.0

ff00d70e · Merge branch 'gitlabds-2-0-0' into 'main' · May 01, 2025

ModelEvaluator Class
* Comprehensive replacement for the older model_metrics function
* Supports a wide variety of classification and regression models, including multi-class problems
* Extensive visualizations (ROC, PR curves, lift charts, SHAP values)
* Detailed metrics and performance analytics in a single, cohesive interface
* Ability to add custom metrics, save plots, and export metrics to file

Apply Functions
* New functions to consistently transform new data using patterns from training:
    * apply_outliers() - Apply existing outlier limits
    * apply_missing_values() - Apply missing value handling
    * apply_dummy() - Apply existing dummy coding
* These enable production pipelines to use identical transformations to training
* Don't require building a separate .py file for scoring transformations - all transformations are handled directly in the configuration yaml file

ConfigGenerator Class
* Automatically creates scoring configuration files modularly
* Supports nested parameters and complex configuration structures
* Perfect for version controlling your model parameters and preprocessing steps

Memory Optimization
* New memory_optimization() function dramatically reduces DataFrame memory usage
* Significantly reduces time to train XGBoost models by taking advantage of sparse arrays
* Configurable precision modes to balance memory usage vs. numeric precision

Other New Functions Added
* generate_sql_trend_query() - Generate SQL for time-period analysis
* trend_analysis() - Analyze time-series data for patterns
Other Notable Improvements
* Consistent return patterns (functions now return both data AND metadata)
* Standardized function names and improved parameter handling
* More robust outlier detection with skew adjustment options
* Better missing value handling with more filling methods
* Enhanced dummy coding with better prefix handling
* Improved correlation/feature reduction with multiple correlation methods
* Enhanced split_data() with stratification options and better sampling

Breaking Changes
* Many of the calls prior to 2.0.0 will not work correctly without slight modifications. Consult the documentation for exact changes.
* inplace parameters removed from all functions to conform with pandas best practices
* missing_fill() and missing_check() combined into missing_values()
* dv_proxies() renamed to remove_outcome_proxies() for better clarity
* memory_usage() renamed to memory_optimization

1.1.2

00e64262 · Merge branch 'fix-up-sampling' into 'main' · Jan 29, 2025
```
fix `split_data` when upsampling
```
1.1.1

905e93ee · Merge branch 'kdietz-main-patch-56d1' into 'main' · Dec 20, 2024
```
Fix slice and copy warnings in dummy coding and outliers
```
1.1.0

827d43a9 · Merge branch 'kdietz-main-patch-f632' into 'main' · Aug 28, 2024
```
Optimized for Python 3.10 and later
```
1.0.24

c83b3e49 · Merge branch 'kdietz-main-patch-1793' into 'main' · Jul 15, 2024
```
remove numba dependency for python 3.11 compatibility
```
1.0.23

bbf71386 · Merge branch 'panda-compatibility-changes' into 'main' · Feb 20, 2024
```
Pandas 2.0 compatibility
```
1.0.22

8a21332f · Merge branch 'update-model-metrics' into 'main' · Sep 28, 2023
```
Add support to model metrics for linear regression modeling
```
1.0.21

085fbffc · Merge branch 'add-numba-requires' into 'main' · Jun 07, 2023
```
Update numba dependencies
```
1.0.20

c5bf850b · Merge branch 'main-patch-0556' into 'main' · May 10, 2023
```
Output decile breaks from model_metrics
```

1.0.19

e74993c3 · Merge branch 'update-readme' into 'main' · Mar 30, 2023

Improved supported for `model_metrics` outside of XGB and RF. Should work for all scikit-learn models now

1.0.18

5c0de172 · Merge branch 'jstemmer1-main-patch-81704' into 'main' · Dec 15, 2022
```
Minor fixes and performance improvements
```