Back to Home

Projects & Tools

Open source scripts and tools I've built for localization, machine translation, and NLP workflows. All available on GitHub.

View all on GitHub

A modern, web-based tool for calculating machine learning model performance metrics from confusion matrix values.

HTMLMachine LearningMetrics
View on GitHub

Jupyter Notebook for calculating Levenshtein distance and Post-Edit Distance (PED) metrics.

Jupyter NotebookNLPMetrics
View on GitHub

A web-based tool for visualizing JSON files with beautiful formatting and interactive tree structure.

HTMLJSONVisualization
View on GitHub

A Python tool for intelligently replacing rows in CSV files based on matching identifier values.

PythonCSVData Processing
View on GitHub

A modern, web-based tool for visualizing CSV files with an intuitive table interface.

HTMLCSVVisualization
View on GitHub

Modern web-based XLIFF translation file visualizer with side-by-side source/target display.

HTMLXLIFFTranslation
View on GitHub

Python-based script that converts CSV files to properly formatted Excel (.xlsx) files.

PythonCSVExcel
View on GitHub

Convert TBX (TermBase eXchange) terminology files to Excel format.

PythonTBXExcel
View on GitHub

Split large Excel files into smaller chunks for easier processing.

PythonExcelData Processing
View on GitHub

Machine Translation evaluation tool using Translation Edit Rate (TER) metrics.

PythonMTEvaluation
View on GitHub

Convert JSON files to Excel spreadsheets with proper formatting.

PythonJSONExcel
View on GitHub

Convert text files to Tab-Separated Values (TSV) format.

PythonTSVConversion
View on GitHub

It converts .tmx files into different output formats.

PythonTMXTranslation Memory
View on GitHub

Convert Tab-Separated Values files to Excel format.

PythonTSVExcel
View on GitHub

Convert Excel files to Tab-Separated Values format.

PythonExcelTSV
View on GitHub

This script compares two XLIFF files and generates a detailed HTML report highlighting differences.

PythonXLIFFQA
View on GitHub

This script converts .tmx files to .xliff files, preserving translation units.

PythonTMXXLIFF
View on GitHub

This script converts pairs of text files containing source and target language segments into a TMX file.

PythonTMXTranslation
View on GitHub

This scripts deletes key/subkeys from a JSON file.

PythonJSONEditing
View on GitHub

Copy .srt content into an Excel file

PythonSRTExcel
View on GitHub

This script converts a pair of text files containing translations into xliff format.

PythonXLIFFTranslation
View on GitHub

It converts pdf files into .txt files and provide the word count of the files

PythonPDFText Processing
View on GitHub

VBA MS Word macro for preparing texts in two columns

VBAMS WordMacro
View on GitHub

Unzipper

Active

Utility for batch unzipping archive files.

PythonZIPUtility
View on GitHub

This script converts an .xlf file into a two-column .txt file.

PythonXLIFFText
View on GitHub

This script converts .xlsx files to .csv format

PythonExcelCSV
View on GitHub

It processes a text file to extract the most common terms, excluding stopwords.

PythonNLPText Analysis
View on GitHub

This script cleans Excel files (.xlsx) by removing rows with empty cells in a specified column.

PythonExcelCleaning
View on GitHub

This script merges files containing columns from various file formats.

PythonData ProcessingMerging
View on GitHub

It converts a .json file into a .csv file

PythonJSONCSV
View on GitHub