Projects

Projects

Selected research, systems, and prototypes I've built or led.

Sort:
28 of 28 total results

BenchING: Structured Output Benchmark for LLMs

A benchmark and framework for evaluating how well LLMs follow structured output formats in narrative PCG tasks, with error taxonomy and scaling analysis.

Research
2025
LLM Evaluation
Prompt Engineering
PCG
Benchmark

OpenRLHF (Contributor)

Contributed to OpenRLHF, an easy-to-use, scalable, and high-performance RLHF framework.

Open Source
2025
RLHF
Open Source
Framework

Thai Earthquake Timeline Visualizer

Interactive timeline visualizer for Thailand earthquake events with filtering and temporal context.

Side Project
2025
Visualization
TypeScript
Open Data

Themis

Lightweight evaluation platform for LLM experiments.

Open Source
2025
LLM Evaluation
Tooling

Thoth

A GUI program for labeling datasets.

Open Source
2025
Data Labeling
GUI
Tooling

Typhoon Application Week

Built and shipped 7+ web apps integrating LLM capabilities as part of a rapid prototyping initiative.

Initiative
2025
LLM Applications
Rapid Prototyping
Typhoon
Showing 6 of 28
Per page:
1/5