Skip to content
View jia-gao's full-sized avatar

Block or report jia-gao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jia-gao/README.md

πŸ‘¨β€πŸ’» About Me

Software engineer at Anthropic, passionate about AI infrastructure and high-performance systems.

πŸš€ Current Focus

  • Scaling LLM inference systems

πŸ“« Connect with Me


πŸ’‘ Open to collaborations on open-source AI infrastructure and ML systems projects!

Popular repositories Loading

  1. kube-gpu-top kube-gpu-top Public

    htop for GPU pods on Kubernetes β€” per-pod GPU utilization, memory, temperature, power, and waste detection

    Go 97 10

  2. leanctx leanctx Public

    Drop-in prompt compression for production LLM apps. Cut your token bill 40-60% without changing your code. Python SDK, LLMLingua-2, MIT.

    Python 1

  3. samza samza Public

    Forked from apache/samza

    Mirror of Apache Samza

    Java

  4. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  5. semantic-router semantic-router Public

    Forked from vllm-project/semantic-router

    Intelligent Mixture-of-Models Router for Efficient LLM Inference

    Python

  6. sglang sglang Public

    Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python