Metadata-Version: 2.1
Name: sibi-dst
Version: 2025.9.6
Summary: Data Science Toolkit
Author: Luis Valverde
Author-email: lvalverdeb@gmail.com
Requires-Python: >=3.11,<4.0
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Requires-Dist: clickhouse-connect (>=0.8.18,<0.9.0)
Requires-Dist: clickhouse-driver (>=0.2.9,<0.3.0)
Requires-Dist: dask[complete] (>=2025.9.0,<2026.0.0)
Requires-Dist: mysqlclient (>=2.2.7,<3.0.0)
Requires-Dist: opentelemetry-exporter-otlp (>=1.35.0,<2.0.0)
Requires-Dist: opentelemetry-sdk (>=1.35.0,<2.0.0)
Requires-Dist: pandas (>=2.3.1,<3.0.0)
Requires-Dist: psycopg2 (>=2.9.10,<3.0.0)
Requires-Dist: pyarrow (>=20.0.0,<21.0.0)
Requires-Dist: pydantic (>=2.11.7,<3.0.0)
Requires-Dist: pyiceberg[hive,s3fs] (>=0.9.1,<0.10.0)
Requires-Dist: pymysql (>=1.1.1,<2.0.0)
Requires-Dist: pyrosm (>=0.6.2,<0.7.0)
Requires-Dist: s3fs (>=2025.5.1,<2026.0.0)
Requires-Dist: sqlalchemy (>=2.0.41,<3.0.0)
Requires-Dist: sse-starlette (>=3.0.2,<4.0.0)
Requires-Dist: tqdm (>=4.67.1,<5.0.0)
Requires-Dist: webdav4 (>=0.10.0,<0.11.0)
Description-Content-Type: text/markdown

### SIBI-DST

Data Science Toolkit built with Python, Pandas, Dask, OpenStreetMaps, NetworkX, SQLAlchemy, GeoPandas, and Folium.

## Example Use Cases

1. **Build DataCubes, DataSets, and DataObjects** from diverse data sources, including **relational databases, Parquet files, Excel (`.xlsx`), delimited tables (`.csv`, `.tsv`), JSON, and RESTful APIs**.
2. **Comprehensive DataFrame Management** utilities for efficient data handling, transformation, and optimization using **Pandas** and **Dask**.
3. **Flexible Data Sharing** with client applications by writing to **Data Warehouses in Clickhouse, local filesystems, and cloud storage platforms** such as **S3**.
4. **Microservices for Data Access** – Build scalable **API-driven services** using **RESTful APIs (`Django REST Framework`, `FastAPI`)** for high-performance data exchange.
5. **Geospatial Analysis** – Utilize **OpenStreetMaps** and **GeoPandas** for advanced geospatial data processing and visualization.

## Supported Technologies

- **Data Processing**: Pandas, Dask
- **Databases & Storage**: SQLAlchemy, Parquet, S3, Clickhouse
- **Mapping & Geospatial Analysis**: OpenStreetMaps, OSMnx, Geopy
- **API Development**: Django REST Framework, FastAPI

## Installation

```bash
# with pip

pip install sibi-dst  # Install only the main package
pip install sibi-dst[geospatial] # Install with geospatial dependencies
pip install sibi-dst[dev,test,geospatial]  # Install all optional dependencies


```

