Hello, I’m Nguyen Phuc Nguyen#

I am a Ph.D. candidate in Systems Engineering at Boston University with a background in Computer Science and Physics. I work on problems that sit at the intersection of software systems, optimization, and applied machine learning, where careful reasoning about state, scalability, and failure modes matters as much as raw performance.

Alongside my academic research, I build independent software projects to explore system design, tooling, and end-to-end implementation in realistic settings.

Education#

  • Ph.D. in Systems Engineering | Boston University (Expected 2026)
  • B.S. in Computer Science & B.A. in Physics | Syracuse University (2021)

Research & Professional Experience#

I am currently a Graduate Research Assistant at the Paschalidis NOC Lab, where I apply machine learning and systems methods to clinical and theoretical problems.

  • Production ML Pipelines
    Designed, built, and deployed an end-to-end ML pipeline trained on a 100,000-patient dataset, now running weekly inference in support of a clinical trial on hypertension prescription at Boston Medical Center.
    I also engineered a data pipeline processing a 10-year dataset (30,000 appointments across 6,000 patients) to predict missed CT screenings and enable targeted patient support.

  • Algorithm Design
    Developed a spectral algorithm for discovering latent policies from demonstrations, guaranteeing global convergence in a single data pass and avoiding common failure modes of Expectation–Maximization methods.

  • Latent Process Modeling
    Contributed to the design and implementation of a flexible framework for discovering latent processes, with applications in bioinformatics (mutation analysis) and remote sensing (hyperspectral unmixing).

Selected Projects#

Mokuro Library#

A self-hosted, multi-user library for reading and editing OCR-backed comics and documents, designed to run on home servers or NAS environments.

This project grew out of a desire to have a reader that treats OCR data as first-class, editable content rather than a static artifact, while remaining practical to self-host and maintain.

Highlights:

  • Server-backed library and storage
    All content and metadata live on the server filesystem, avoiding browser storage limits and enabling access across devices and users.

  • Multi-user design with private state
    Users share a common library while maintaining independent reading progress, bookmarks, and OCR edits.

  • Interactive OCR reader and editor
    Documents are displayed with selectable OCR text overlays, and OCR output can be corrected directly in the browser by editing text and adjusting bounding boxes in place.

  • Non-destructive OCR editing
    OCR changes are tracked as structured edits rather than overwriting files, enabling undo/redo, reset to the original version, and reconciliation with upstream updates.

  • Performance-aware document viewing
    Long documents are rendered using virtualized scrolling to avoid performance and memory issues when viewing hundreds of pages.

  • Reliable export for large archives
    Volumes and collections can be exported as ZIP or PDF files using streaming downloads, avoiding large in-memory buffers and supporting stable downloads for large libraries.

  • Designed for self-hosting
    Runs locally via Docker Compose with minimal configuration, making it suitable for trusted LAN or VPN setups.

Matter Phase Simulation#

A systems-oriented simulation project exploring emergent physical phenomena using Rust.

This project implements a particle simulation with an efficient grid-based method to visualize non-ideal gas behavior, crystalline formation, and annealing processes, using the Bevy engine for visualization.

Quantum Virtual Machine (QVIM)#

An experimental project exploring language design and metaprogramming in Julia.

QVIM provides a domain-specific language for expressing quantum logic circuits with readable syntax, along with a simulator that optimizes gate operations via basis changes.

Technical Skills#

DomainTools & Languages
LanguagesPython, TypeScript, Julia, Rust, Bash, SQL, C++, C, Lua
ML & DataPyTorch, scikit-learn, Hugging Face, pandas, Weights & Biases, cvxpy, Jupyter
Systems & DevDocker, Git, Linux/Unix, Nix, Cargo, Conda, LaTeX
Core CompetenciesData Structures, Algorithms, Probability, Statistics, Optimization, Linear Programming

Connect#