I am a Research Scientist at Zoom working on Automatic Speech Recognition (ASR).

I obtained Master of Science in Intelligent Information Systems at Carnegie Mellon University Language Technologies Institute. I was fortunate to be advised by Dr. Shinji Watanabe. Under his supervision, I worked on speech processing problems and contributed to ESPnet.

Before that, I obtained my joint Bachelor of Science in Data Science degree at Duke Kunshan University and Duke University.

Publications

Machine Learning

ProjectDescription
espnetContributing to Espnet2. Including the MAGICDATA ASR recipe and Aphasia English ASR recipe for [1]
speech-recognitionA hand-written speech recognition system for English pronunciation of 10 digits using Python+Numpy
asr-ctcImplementation of the Conformer CTC speech recognition architecture
tone_classifierMandarin Tone Classification experiments

Software Engineering

ProjectDescription
tanA compiler for my programming language called tan using LLVM+Clang
tosA toy operating system called TOS that supports paging, APIC, ACPI, VBE console, and keyboard input with a custom libc
NO-tificationsRemove any notifications on Android

Game Dev

ProjectDescription
ExtendedCharacterMovementA Unreal Engine plugin for extended character movement component for FPS/TPS.
tjy_vic3_fixA collection of my Victoria 3 quality-of-life mod
dynamic_road_genA procedural road mesh generator like the one used in Cities:Skylines