Skip to content
View ymlei's full-sized avatar

Block or report ymlei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ymlei/README.md

Yiming Lei

PhD @ Max Planck Institute for Informatics · Advised by Yiting Xia

I build datacenter systems — from optical datacenter networks and nanosecond-precision time synchronization to fault-tolerant distributed ML training.


Research · Google Scholar

  • 📄 Two first-author papers at NSDI'26OpenOptics (optical DCN framework) and SyncWise (time synchronization)
  • 📄 HotNets'22 — parallelism-aware flow scheduling for distributed training
  • 📄 Phoenix — checkpoint-less failure recovery for auto-parallelism in JAX/XLA (under submission)

Open Source

  • 🔧 OpenOptics — design, test, and deploy optical DCN architectures in ~10 lines of Python
  • 🔧 SyncWise — error-aware time synchronization for reconfigurable DCNs
  • 🔧 JAX — contributed to JAX's fault tolerance API

Industry

  • 🏢 Applied Scientist Intern @ AWS AI (2024–2025) — resilient distributed training with JAX/XLA

📫 ymlei.github.io · LinkedIn

Pinned Loading

  1. mpi-ncs/openoptics mpi-ncs/openoptics Public

    Easy design, testing, and deployment of optical data center networks for everyone.

    Python 70 10

  2. mpi-ncs/SyncWise mpi-ncs/SyncWise Public

    Python

  3. jax jax Public

    Forked from jax-ml/jax

    Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

    Python