- California
-
20:46
(UTC -07:00)
Pinned Loading
-
vllm-project/vllm-omni
vllm-project/vllm-omni PublicA framework for efficient model inference with omni-modality models
-
vllm-project/llm-compressor
vllm-project/llm-compressor PublicTransformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
-
On-Device-Agent-for-adaptive-display-optimization
On-Device-Agent-for-adaptive-display-optimization PublicWe present a novel on-device hybrid agent combining LLMs with retrieval-augmented generation for real-time display optimization. The system achieves 92% accuracy with CoreML acceleration delivering…
Swift 1
-
ARS-Adaptive-Reasoning-Suppression-for-Efficient-Large-Reasoning-Language-Models
ARS-Adaptive-Reasoning-Suppression-for-Efficient-Large-Reasoning-Language-Models PublicAdaptive Reasoning Suppression for Efficient Large Reasoning Language Models
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
