Metadata-Version: 2.4
Name: cucll-learn
Version: 0.1.2
Summary: One-shot data exploration & cleaning toolkit
Home-page: https://github.com/johnpenuel03/cucll-learn
Author: Penuel John, CUCLL
License: MIT
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas>=1.0
Requires-Dist: numpy>=1.18
Requires-Dist: scipy>=1.4
Requires-Dist: matplotlib>=3.0
Requires-Dist: seaborn>=0.10
Dynamic: author
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: license
Dynamic: license-file
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

# CUCLL.LEARN

One-shot data exploration and cleaning for pandas DataFrames

```python
from cucll.learn import explore_data, clean_data

# Explore
explore_data(df, show_hg=True)

# Clean
cleaned_df = clean_data(df)
```

## Features
- **Smart Exploration**: Shape, dtypes, missing values, outliers, correlations
- **Battle-Tested Cleaning**: Outliers, missing values, duplicates
- **Visual Insights**: Auto-generated histograms & boxplots
- **Production Ready**: Type validation and error handling

## Install
```bash
# pip
pip install cucll.learn

# conda
conda install -c conda-forge cucll.learn
```

## Documentation
- [Quickstart Guide](docs/quickstart.ipynb)
- [API Reference](docs/api.md)

## Examples
```python
# Custom cleaning pipeline
from cucll.learn import handle_outliers, handle_missing

df = handle_outliers(df, threshold=2.5)
df = handle_missing(df, num_method="median")
```

## Examples
- [Quickstart](docs/quickstart.ipynb)
- [Basic Cleaning](examples/basic_cleaning.ipynb)
- [Outlier Detection](examples/outlier_detection.ipynb)
