Metadata-Version: 2.4
Name: datainsightx-raj
Version: 0.1.1
Summary: A lightweight data quality and visualization toolkit for pandas DataFrames
Home-page: https://github.com/TRajalingam/datainsightx-raj
Author: RAJALINGAMT
Author-email: raju031001@gmail.com
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas
Requires-Dist: numpy
Requires-Dist: plotly
Requires-Dist: jinja2
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: license-file
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

#  DataInsightX-Raj

**A lightweight, beginner-friendly Python library for automated data quality checks and interactive visualizations.**

---

##  Overview

`DataInsightX-Raj` helps **data engineers, analysts, and students** quickly assess and visualize the quality of their datasets.

With just one command or a few lines of code, you can:
- Detect **missing values**  
- Find **duplicate rows**  
- Validate **schema consistency**  
- Identify **data drift** between datasets  
- Generate an **interactive dashboard** (Plotly + HTML)

---

##  Why This Project?

I built this library to simulate a **real-world data engineering task** — validating and profiling data before analysis or model training.

This project demonstrates:
- Python packaging and CLI development  
- Data validation and visualization skills  
- Open-source best practices (README, tests, PyPI readiness)  

 Great to showcase on a **fresher data engineer resume**!

---

##  Features

| Category | Feature | Description |
|-----------|----------|-------------|
| **Data Quality** | Missing Value Report | Identify missing values and their percentages |
|  | Duplicate Detection | Detect duplicate rows |
|  | Schema Validation | Check if data matches expected structure |
|  | Data Drift | Compare statistics between two datasets |
| **Visualization** | Automated Dashboard | Generate interactive HTML reports |
| **CLI Tool** | `datainsightx analyze file.csv` | Run full analysis from the terminal |

---

##  Installation

Install the library from PyPI:

```bash
pip install datainsightx-raj
