This repository contains a Jupyter notebook with sample codes from basic to major NLP processes required for dealing with text.
-
Updated
May 2, 2018 - Jupyter Notebook
This repository contains a Jupyter notebook with sample codes from basic to major NLP processes required for dealing with text.
Deterministic, offline parser for messy Indian addresses. 26,711 pincodes embedded with OSM centroids, DIGIPIN encode/decode, phonetic alias matching (Gurgaon/Gurugram, Bengaluru/Bangalore), zero required dependencies. Optional opt-in Nominatim geocoding.
Specialized Lightweight Text Matching and Correction Engine
A powerful CLI tool to find exact, fuzzy, and phonetic duplicates in CSV and Excel files with detailed reports.
Cross-language entity resolution engine with pluggable entity types, multi-strategy ensemble scoring, and explainable results
LinguaLink: A robust multilingual duplicate detection engine using semantic, phonetic, and fuzzy signals.
Intelligent recruitment management platform for seasonal operations
Add a description, image, and links to the phonetic-matching topic page so that developers can more easily learn about it.
To associate your repository with the phonetic-matching topic, visit your repo's landing page and select "manage topics."