Projects & Tools
Open source scripts and tools I've built for localization, machine translation, and NLP workflows. All available on GitHub.
View all on GitHubConfusion Matrix Calculator
ActiveA modern, web-based tool for calculating machine learning model performance metrics from confusion matrix values.
View on GitHubPED lev-dist
ActiveJupyter Notebook for calculating Levenshtein distance and Post-Edit Distance (PED) metrics.
View on GitHubJSON Visualizer
ActiveA web-based tool for visualizing JSON files with beautiful formatting and interactive tree structure.
View on GitHubCSV Row Replacer
ActiveA Python tool for intelligently replacing rows in CSV files based on matching identifier values.
View on GitHubCSV Content Visualizer
ActiveA modern, web-based tool for visualizing CSV files with an intuitive table interface.
View on GitHubXLIFF Content Visualizer
ActiveModern web-based XLIFF translation file visualizer with side-by-side source/target display.
View on GitHubCSV to Excel
ActivePython-based script that converts CSV files to properly formatted Excel (.xlsx) files.
View on GitHubTBX to Excel
ActiveConvert TBX (TermBase eXchange) terminology files to Excel format.
View on GitHubExcel Splitter
ActiveSplit large Excel files into smaller chunks for easier processing.
View on GitHubMT Evaluator TER
ActiveMachine Translation evaluation tool using Translation Edit Rate (TER) metrics.
View on GitHubJSON to Excel Converter
ActiveConvert JSON files to Excel spreadsheets with proper formatting.
View on GitHubXLIFF Comparison Tool
ActiveThis script compares two XLIFF files and generates a detailed HTML report highlighting differences.
View on GitHubTMX to XLIFF
ActiveThis script converts .tmx files to .xliff files, preserving translation units.
View on GitHubText File to TMX
ActiveThis script converts pairs of text files containing source and target language segments into a TMX file.
View on GitHubText File to XLIFF
ActiveThis script converts a pair of text files containing translations into xliff format.
View on GitHubPDF to Text and Word Count
ActiveIt converts pdf files into .txt files and provide the word count of the files
View on GitHubFrequent Terms Extractor
ActiveIt processes a text file to extract the most common terms, excluding stopwords.
View on GitHubDelete Rows with Empty Cells
ActiveThis script cleans Excel files (.xlsx) by removing rows with empty cells in a specified column.
View on GitHubMerge Column Files
ActiveThis script merges files containing columns from various file formats.
View on GitHub