Welcome to Tométo Tomato

Tired of messy data breaking your joins?

Tométo Tomato is your powerful command-line tool for fuzzy data integration. It intelligently connects disparate datasets by similarity, not just exact matches, even with typos or inconsistencies.

Leveraging DuckDB and rapidfuzz, Tométo Tomato makes cleaning and merging your data fast, flexible, and reliable.

Unlock the true potential of your data. Get started now!

Let’s Call the Whole Thing Off!!

The raw data
city region
Cefalu’ Sicilia
Reggio Calabria CALABRIA
RODENGO-SAIANO Lombardia
The reference data
city region city_code
Cefalù Sicilia 082027
Reggio di Calabria Calabria 080063
Rodengo Saiano Lombardia 017164
Example fuzzy join result
city (input) region (input) city (ref) region (ref) city_code avg_score
Cefalu’ Sicilia Cefalù Sicilia 082027 100.0
RODENGO-SAIANO Lombardia Rodengo Saiano Lombardia 017164 98.14
Reggio Calabria CALABRIA Reggio di Calabria Calabria 080063 95.45


Intro 👉 How to use it