Back to Home

Projects & Tools

Open source scripts and tools I've built for localization, machine translation, and NLP workflows. All available on GitHub.

View all on GitHub

Confusion Matrix Calculator

Active

A modern, web-based tool for calculating machine learning model performance metrics from confusion matrix values.

HTMLMachine LearningMetrics
View on GitHub

PED lev-dist

Active

Jupyter Notebook for calculating Levenshtein distance and Post-Edit Distance (PED) metrics.

Jupyter NotebookNLPMetrics
View on GitHub

JSON Visualizer

Active

A web-based tool for visualizing JSON files with beautiful formatting and interactive tree structure.

HTMLJSONVisualization
View on GitHub

CSV Row Replacer

Active

A Python tool for intelligently replacing rows in CSV files based on matching identifier values.

PythonCSVData Processing
View on GitHub

CSV Content Visualizer

Active

A modern, web-based tool for visualizing CSV files with an intuitive table interface.

HTMLCSVVisualization
View on GitHub

XLIFF Content Visualizer

Active

Modern web-based XLIFF translation file visualizer with side-by-side source/target display.

HTMLXLIFFTranslation
View on GitHub

CSV to Excel

Active

Python-based script that converts CSV files to properly formatted Excel (.xlsx) files.

PythonCSVExcel
View on GitHub

TBX to Excel

Active

Convert TBX (TermBase eXchange) terminology files to Excel format.

PythonTBXExcel
View on GitHub

Excel Splitter

Active

Split large Excel files into smaller chunks for easier processing.

PythonExcelData Processing
View on GitHub

MT Evaluator TER

Active

Machine Translation evaluation tool using Translation Edit Rate (TER) metrics.

PythonMTEvaluation
View on GitHub

JSON to Excel Converter

Active

Convert JSON files to Excel spreadsheets with proper formatting.

PythonJSONExcel
View on GitHub

Text File to TSV

Active

Convert text files to Tab-Separated Values (TSV) format.

PythonTSVConversion
View on GitHub

TMX Converter

Active

It converts .tmx files into different output formats.

PythonTMXTranslation Memory
View on GitHub

TSV to XLSX

Active

Convert Tab-Separated Values files to Excel format.

PythonTSVExcel
View on GitHub

XLSX to TSV

Active

Convert Excel files to Tab-Separated Values format.

PythonExcelTSV
View on GitHub

XLIFF Comparison Tool

Active

This script compares two XLIFF files and generates a detailed HTML report highlighting differences.

PythonXLIFFQA
View on GitHub

TMX to XLIFF

Active

This script converts .tmx files to .xliff files, preserving translation units.

PythonTMXXLIFF
View on GitHub

Text File to TMX

Active

This script converts pairs of text files containing source and target language segments into a TMX file.

PythonTMXTranslation
View on GitHub

JSON Keys Editor

Active

This scripts deletes key/subkeys from a JSON file.

PythonJSONEditing
View on GitHub

SRT to Excel

Active

Copy .srt content into an Excel file

PythonSRTExcel
View on GitHub

Text File to XLIFF

Active

This script converts a pair of text files containing translations into xliff format.

PythonXLIFFTranslation
View on GitHub

PDF to Text and Word Count

Active

It converts pdf files into .txt files and provide the word count of the files

PythonPDFText Processing
View on GitHub

PrepareIn2Columns

Active

VBA MS Word macro for preparing texts in two columns

VBAMS WordMacro
View on GitHub

Unzipper

Active

Utility for batch unzipping archive files.

PythonZIPUtility
View on GitHub

XLF to TXT

Active

This script converts an .xlf file into a two-column .txt file.

PythonXLIFFText
View on GitHub

XLSX to CSV

Active

This script converts .xlsx files to .csv format

PythonExcelCSV
View on GitHub

Frequent Terms Extractor

Active

It processes a text file to extract the most common terms, excluding stopwords.

PythonNLPText Analysis
View on GitHub

Delete Rows with Empty Cells

Active

This script cleans Excel files (.xlsx) by removing rows with empty cells in a specified column.

PythonExcelCleaning
View on GitHub

Merge Column Files

Active

This script merges files containing columns from various file formats.

PythonData ProcessingMerging
View on GitHub

Convert JSON to CSV

Active

It converts a .json file into a .csv file

PythonJSONCSV
View on GitHub