Projects

Projects

Selected research, systems, and prototypes I've built or led.

Sort:
29 of 29 total results

Typhoon-Si Med-Thinking 4B

A 4B medical reasoning model from Typhoon and SiData+ that generates ranked diagnoses, capturing clinical uncertainty and outperforming larger models on major medical QA benchmarks.

Research
2026
Large Language Models
Reasoning
Medical
Open Source

BenchING: Structured Output Benchmark for LLMs

A benchmark and framework for evaluating how well LLMs follow structured output formats in narrative PCG tasks, with error taxonomy and scaling analysis.

Research
2025
LLM Evaluation
Prompt Engineering
PCG
Benchmark

OpenRLHF (Contributor)

Contributed to OpenRLHF, an easy-to-use, scalable, and high-performance RLHF framework.

Open Source
2025
RLHF
Open Source
Framework

Thai Earthquake Timeline Visualizer

Interactive timeline visualizer for Thailand earthquake events with filtering and temporal context.

Side Project
2025
Visualization
TypeScript
Open Data

Themis

Lightweight evaluation platform for LLM experiments.

Open Source
2025
LLM Evaluation
Tooling

Thoth

A GUI program for labeling datasets.

Open Source
2025
Data Labeling
GUI
Tooling
Showing 6 of 29
Per page:
1/5