Metadata-Version: 2.4
Name: hintify
Version: 0.1.2
Summary: SnapAssist AI – real-time clipboard-to-hints assistant
Author: Hintify Contributors
License-Expression: MIT
Project-URL: Homepage, https://github.com/AryanVBW/Hintify
Keywords: education,ocr,ai,ollama,gemini,hints
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: OS Independent
Classifier: Intended Audience :: Education
Classifier: Topic :: Education
Requires-Python: >=3.9
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pillow
Requires-Dist: pytesseract
Requires-Dist: keyring
Requires-Dist: requests
Requires-Dist: google-generativeai
Requires-Dist: pynput
Dynamic: license-file

<div align="center">
  <img src="logo.png" alt="Hintify" height="160">
  <h1>Hintify</h1>
  <p><b>Take a screenshot → get gentle hints (not answers).</b></p>
</div>
<br>

SnapAssist AI is a **real-time AI-powered study assistant** that helps students solve problems independently by giving **progressive hints instead of direct answers**.

Take a **screenshot of a question** (assignments, PDFs, exams, etc.). SnapAssist will:

- Detect the screenshot from the clipboard
- Extract text via OCR (Tesseract)
- Classify the question (MCQ or Descriptive) and estimate difficulty
- Query an AI model (Ollama locally, or Gemini via API) to generate stepwise hints
- Show hints in a simple GUI, or print them in headless mode

---

## ✨ Features

- 📸 **Cross-platform screenshot detection** – Uses clipboard on macOS, Windows, and Linux
- 🔎 **OCR with Tesseract** – Robust text extraction via `pytesseract`
- 🧩 **Question classification** – MCQ vs. descriptive
- ⚖️ **Difficulty estimation** – Easy / Medium / Hard
- 🤖 **LLM provider selection** – Local via Ollama, or cloud via Gemini
- ⬇️ **Auto-pull Ollama model** – Ensures the selected model is available before first use
- 🔐 **Keychain storage** – Saves Gemini API key securely via `keyring` (one-time setup)
- 🖥️ **GUI or headless** – Tkinter GUI if available, otherwise prints to console
- Beautiful, distraction-free UI with styled hints (bold labels, soft colors)

---

## 🛠 Tech Stack

**Languages & Libraries**

- `Python`
- `pillow` (Image handling, clipboard grab)
- `pytesseract` (OCR)
- `tkinter` (optional GUI)
- `keyring` (secure credential storage)
- `google-generativeai` (Gemini API)

**External Tools**

- **Tesseract OCR** → Text recognition
- **Ollama** → Local AI inference (default model `granite3.2-vision:2b`)

---

## 📦 Installation (Users)

Install from PyPI (recommended):
```
pip install -U hintify
```
Then start:
```
hintify
```

2) Install Tesseract (required)
- macOS: `brew install tesseract`
- Windows: `choco install tesseract` (or download from the UB Mannheim builds)
- Linux: `sudo apt-get install tesseract-ocr` (or your distro equivalent)

3) Option A: Local via Ollama (recommended for offline)
- Install Ollama from: `https://ollama.com/download`
- The app will auto-pull the model on first run

If you prefer source install:
```
pip install -r requirements.txt
python hintify.py
```

4) Option B: Gemini API (cloud)
- Get an API key from Google AI Studio: [`https://aistudio.google.com/apikey`](https://aistudio.google.com/apikey)
- The app will offer to open the page and save your key securely on first run

---

## 🚀 Quickstart

Run from terminal after install:
```
hintify
```

Hotkeys:
- macOS (global): Cmd+Shift+H to capture (may require Accessibility permission)
- Windows (global): Ctrl+Shift+H to capture (opens Snipping Tool)
- In-app: press 'c' while the window is focused, or click Capture

Headless mode (no tkinter installed):
```
python hintify.py --no-gui
```

Take a screenshot and copy it to the clipboard. Hints will appear automatically.

- GUI extras:
  - Click the "Capture (press 'c')" button or press 'c' while the window is focused to select an area and process it immediately (macOS).

---

## ⚙️ Configuration

CLI options:
```
python hintify.py \
  --provider {ollama|gemini} \
  --ollama-model granite3.2-vision:2b \
  --gemini-model gemini-2.0-flash \
  --poll-interval 1.5 \
  --no-gui
```

Environment variables:
- `HINTIFY_PROVIDER` – Force provider (`ollama` or `gemini`)
- `HINTIFY_OLLAMA_MODEL` – Ollama model ID (default `granite3.2-vision:2b`)
- `GEMINI_API_KEY` – Gemini API key
- `GEMINI_MODEL` – Gemini model ID (default `gemini-2.0-flash`; auto-fallback to `gemini-1.5-flash`)

Provider behavior:
- If Ollama is installed, the app uses Ollama and auto-pulls the model if missing
- If Ollama is not installed, the app offers Gemini setup, opens the API key page, and saves the key in the system keychain
- If both are available, you can select via `--provider` or `HINTIFY_PROVIDER`

Key storage:
- Gemini key is stored securely with `keyring` (`service` = `hintify`, `username` = `gemini_api_key`)
- To clear the saved key:
```
python -c "import keyring; keyring.delete_password('hintify','gemini_api_key')"
```

---

## 🔄 How it Works

1. Monitor clipboard for images
2. OCR with Tesseract
3. Classify question type and difficulty
4. Build a “hints-only” prompt
5. Query Ollama or Gemini
6. Show 3–5 progressive hints (concept → formula → setup → approach → nudge)

---

## 🧰 Troubleshooting

- Tesseract not found: install it and ensure `tesseract` is on PATH (see Installation)
- Clipboard returns `None` on Linux: ensure a desktop environment/clipboard manager is running
- Ollama not found: install from `https://ollama.com/download`
- Gemini 2.5 model not available to your account: the app auto-falls back to `gemini-1.5-flash`

---

## 📌 Project Highlights

- 🔄 Real-time monitoring without user clicks
- 🖼 Optional GUI that doesn’t interrupt workflow
- 📚 Learning-first – Encourages solving, not spoon-feeding
- 🔗 Cross-cutting design – OCR, LLMs, GUI/headless
- 🛠 Extensible – Swap models, add features, redesign UI easily
