Metadata-Version: 2.4
Name: datarun
Version: 0.2.6
Summary: A simple data cleansing tool using pandas and Machine learning models
Home-page: https://github.com/arunsundark01/datarun
Author: Arun Sundar K
Author-email: karthicksundar2001@gmail.com
License: MIT
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: license-file
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

# datarun

**datarun** is a lightweight Python package that helps you cleanse your pandas DataFrames with minimal configuration.  
It supports automatic handling of duplicates, missing values, constant columns, and type conversion.

## Features

- Drop duplicate rows
- Handle missing values using mean, median, mode, or drop
- Drop constant-value columns
- Convert string-based numeric columns to proper types
- Configurable and simple to use

## Example: Linear Regression

```python
from datarun import LinearRegressionCustom
import pandas as pd

data = pd.read_csv("Salary_dataset.csv")
X = data[['YearsExperience']]
y = data['Salary']

model = LinearRegressionCustom(method='gradient_descent', learning_rate=0.01, epochs=5000)
model.fit(X, y)
preds = model.predict(X)
print(model.get_params())

## Installation

pip install datarun
