-
-
Notifications
You must be signed in to change notification settings - Fork 29
Open
Description
Issue Metrics (2025-11-01..2025-11-30)
Summary
- Total issues/PRs: 27
- Closed issues/PRs: 27
- Average time to close: -575219 days, 18:45:46
Issues and Pull Requests
- [BUG] CUDA agent segfault at exit caused by watcher thread — eunomia-bpf/bpftime #521 (✅ Closed)
- [FEATURE] — eunomia-bpf/bpftime #519 (✅ Closed)
- [FEATURE] Automatically detect and set
BPFTIME_SM_ARCHbased on device — eunomia-bpf/bpftime #516 (✅ Closed) - Weekly Org Report (2025-11-10..2025-11-16) — eunomia-bpf/eunomia.dev Weekly Org Report (2025-11-10..2025-11-16) #55 (✅ Closed)
- [FEATURE] Collect all gpu macro definitions into one readme docs — eunomia-bpf/bpftime #509 (✅ Closed)
- Best practices for real-time streaming of eBPF SSL/TLS capture data in Kubernetes? — eunomia-bpf/bpf-developer-tutorial #187 (✅ Closed)
- Outdated README file — eunomia-bpf/bpftime #508 (✅ Closed)
- the collector program fails to load the eBPF object,error is R2 min value is negative — eunomia-bpf/agentsight Add workflow dispatch trigger page deploy #20 (✅ Closed)
- [FEATURE] Reduce the cost of cudaHostRegister to improve performance — eunomia-bpf/bpftime #506 (✅ Closed)
- [FEATURE] Add Per-thread runtime distribution example for bpftime GPU tracing — eunomia-bpf/bpftime #505 (✅ Closed)
- [FEATURE] Add SM / warp / lane mapping example for bpftime GPU tracing — eunomia-bpf/bpftime #504 (✅ Closed)
- [FEATURE] setTargetTriple API change for LLVM ≥15 — eunomia-bpf/llvmbpf Adaptation Issues on CentOS 7 #36 (✅ Closed)
- Weekly Org Report (2025-11-03..2025-11-09) — eunomia-bpf/eunomia.dev Weekly Org Report (2025-11-03..2025-11-09) #54 (✅ Closed)
- [FEATURE] Use gdrcopy to implement high performance maps — eunomia-bpf/bpftime #500 (✅ Closed)
- Weekly Org Report (2025-10-27..2025-11-02) — eunomia-bpf/eunomia.dev Weekly Org Report (2025-10-27..2025-11-02) #52 (✅ Closed)
- [FEATURE] Request multiple devices support — eunomia-bpf/bpftime #497 (✅ Closed)
- [FEATURE] Request BPF HashMap support on GPU — eunomia-bpf/bpftime #496 (✅ Closed)
- [FEATURE] local MCP capture. — eunomia-bpf/agentsight Fix links and add blogs from eunomia-bpf #18 (✅ Closed)
- [FEATURE] Support for Windows and macOS — eunomia-bpf/agentsight Fix typo #17 (✅ Closed)
- [BUG] Regex in ptxpass fails to match floating-point and .b128 types — eunomia-bpf/bpftime #491 (✅ Closed)
- [BUG] - Agentsight deployed via Docker container not picking up network logs — eunomia-bpf/agentsight add other project page #16 (✅ Closed)
- Question - Browser agents — eunomia-bpf/agentsight fix: deploy in push #15 (✅ Closed)
- Unable to map CUDA IPC memory for shared array map, error=201 — eunomia-bpf/bpftime #488 (✅ Closed)
- Weekly Org Report (2025-10-20..2025-10-26) — eunomia-bpf/eunomia.dev Weekly Org Report (2025-10-20..2025-10-26) #51 (✅ Closed)
- [FEATURE] Add CI for running GPU related examples — eunomia-bpf/bpftime #487 (✅ Closed)
- Monthly Org Report (2025-10-01..2025-10-31) — eunomia-bpf/eunomia.dev Monthly Org Report (2025-10-01..2025-10-31) #50 (✅ Closed)
- [BUG] BPF map consistency between host and GPU devices with multiple thread blocks — eunomia-bpf/bpftime #486 (✅ Closed)
Org Activity Addendum (eunomia-bpf, 2025-11-01..2025-11-30)
New Stars
- Total new stars: 286
By Repository:
- eunomia-bpf/eunomia-bpf: 13 new stars
- eunomia-bpf/wasm-bpf: 7 new stars
- eunomia-bpf/bpf-developer-tutorial: 91 new stars
- eunomia-bpf/GPTtrace: 5 new stars
- eunomia-bpf/libbpf-rs-starter-template: 2 new stars
- eunomia-bpf/cilium-ebpf-starter-template: 2 new stars
- eunomia-bpf/awesome-ebpf-zh: 4 new stars
- eunomia-bpf/KEN: 1 new stars
- eunomia-bpf/kernel-bpf-library: 1 new stars
- eunomia-bpf/bpftime: 59 new stars
- eunomia-bpf/eunomia.dev: 16 new stars
- eunomia-bpf/GPTtrace-web: 1 new stars
- eunomia-bpf/inline-hook-demo: 1 new stars
- eunomia-bpf/eGPU: 9 new stars
- eunomia-bpf/llvmbpf: 9 new stars
- eunomia-bpf/code-survey: 1 new stars
- eunomia-bpf/basic-cuda-tutorial: 5 new stars
- eunomia-bpf/cupti-tutorial: 3 new stars
- eunomia-bpf/nvbit-tutorial: 1 new stars
- eunomia-bpf/xpu-perf: 9 new stars
- eunomia-bpf/schedcp: 22 new stars
- eunomia-bpf/agentsight: 17 new stars
- eunomia-bpf/MCPtrace: 5 new stars
- eunomia-bpf/gpu_ext_policy: 2 new stars
New Repositories
- eunomia-bpf/bam — created 2025-11-07T21:35:30Z
- eunomia-bpf/opentelemetry-ebpf-profiler — created 2025-11-10T08:20:35Z
- eunomia-bpf/UVM_benchmark — created 2025-11-11T05:47:18Z
- eunomia-bpf/xsched — created 2025-11-17T05:00:09Z
- eunomia-bpf/faiss — created 2025-11-18T20:04:27Z
- eunomia-bpf/llama.cpp — created 2025-11-19T01:57:05Z
- eunomia-bpf/gpu_ext_policy — created 2025-11-20T06:16:17Z
- eunomia-bpf/vllm — created 2025-11-26T05:25:17Z
Pull Requests Opened
- Fix issue #505 — eunomia-bpf/bpftime #520
- Implement gpu-kernel shared map — eunomia-bpf/bpftime #518
- Feat/per thread timing dist — eunomia-bpf/bpftime #517
- [+] Feat: Add kprobe entry support for CUDA kernel tracing — eunomia-bpf/bpftime #515
- [~] Fix error of launchlate example — eunomia-bpf/bpftime #514
- Fix LLVM Triple API for newer LLVM versions — eunomia-bpf/llvmbpf #39
- docs: add openSUSE installation command for bpftrace — eunomia-bpf/MCPtrace Add mkdocs configurations #2
- Fix incorrect include paths in schedcp-cli — eunomia-bpf/schedcp Update main.html for google analysis #6
- [+] Fix: fix ptxpass error that fails to match floating-point — eunomia-bpf/bpftime #513
- Add a simple example for faiss — eunomia-bpf/bpftime #512
- [Feature]: Add cutlass support — eunomia-bpf/bpftime #511
- [~]Fix: Change dynamic library loading to avoid glibc error — eunomia-bpf/bpftime #510
- Add option for LLVM preload workaround in CMake and update JIT context — eunomia-bpf/llvmbpf docs: Update README.md #38
- Update for "Reduce the cost of cudaHostRegister to improve performance" — eunomia-bpf/llvmbpf [BUG]
docs/tutorialslisted in.gitignore#37 - fix the issue where the eBPF verifier rejects the probe_SSL_write_ex_exit — eunomia-bpf/agentsight Update build instruction for
bpftime#21 - Make tutorials build on a DGX Spark w/ CUDA 13.0 — eunomia-bpf/cupti-tutorial Add README #3
- [+]Feat: Reduce the cost of cudaHostRegister to improve performance — eunomia-bpf/bpftime #507
- Fix GPU-CPU memory synchronization and illegal memory access on NVIDI… — eunomia-bpf/bpftime #503
- impl builtin GPU hashmap — eunomia-bpf/bpftime #502
- [feature] Support for pytorch on GPU — eunomia-bpf/bpftime #501
- Remove
#ifndefaround__libc_start_main— eunomia-bpf/bpftime #499 - Fix incorrect include path — eunomia-bpf/schedcp fix homepage h1 text color in dark mode #5
- Fixed wrong redirection — eunomia-bpf/eunomia.dev Fixed wrong redirection #53
- Enhance error logging in CUDA kernel launch and improve build configu… — eunomia-bpf/bpftime #498
- fix(32-wallclock-profiler): use absolute path to find tool paths — eunomia-bpf/bpf-developer-tutorial #186
- [+] Feat: Add cuda_memcpy_to_symbol for fixing constant sync issue — eunomia-bpf/bpftime #495
- Add gzip processing logic in alayzer — eunomia-bpf/agentsight Rename Other-Projects tag #19
- fix: update test-ptxpass.yml for proxy — eunomia-bpf/bpftime #494
- fix: generate build-time header for PTX pass paths to avoid shell escaping issues — eunomia-bpf/bpftime #493
- Fix scheduler test script to use relative paths — eunomia-bpf/schedcp Add workflow for deployment #4
- Fix: fix SIGSEGV from server log — eunomia-bpf/bpftime #492
- [CI] Add gpu examples CI tests — eunomia-bpf/bpftime #490
- [Bug Fix] Fix SHM open_type judgement — eunomia-bpf/bpftime #489
Pull Requests Merged
- [+] Feat: Add kprobe entry support for CUDA kernel tracing — eunomia-bpf/bpftime #515
- [~] Fix error of launchlate example — eunomia-bpf/bpftime #514
- Fix incorrect include paths in schedcp-cli — eunomia-bpf/schedcp Update main.html for google analysis #6
- [+] Fix: fix ptxpass error that fails to match floating-point — eunomia-bpf/bpftime #513
- Add a simple example for faiss — eunomia-bpf/bpftime #512
- [Feature]: Add cutlass support — eunomia-bpf/bpftime #511
- [~]Fix: Change dynamic library loading to avoid glibc error — eunomia-bpf/bpftime #510
- Add option for LLVM preload workaround in CMake and update JIT context — eunomia-bpf/llvmbpf docs: Update README.md #38
- Update for "Reduce the cost of cudaHostRegister to improve performance" — eunomia-bpf/llvmbpf [BUG]
docs/tutorialslisted in.gitignore#37 - fix the issue where the eBPF verifier rejects the probe_SSL_write_ex_exit — eunomia-bpf/agentsight Update build instruction for
bpftime#21 - [+]Feat: Reduce the cost of cudaHostRegister to improve performance — eunomia-bpf/bpftime #507
- [feature] Support for pytorch on GPU — eunomia-bpf/bpftime #501
- Remove
#ifndefaround__libc_start_main— eunomia-bpf/bpftime #499 - Fix incorrect include path — eunomia-bpf/schedcp fix homepage h1 text color in dark mode #5
- Enhance error logging in CUDA kernel launch and improve build configu… — eunomia-bpf/bpftime #498
- fix(32-wallclock-profiler): use absolute path to find tool paths — eunomia-bpf/bpf-developer-tutorial #186
- [+] Feat: Add cuda_memcpy_to_symbol for fixing constant sync issue — eunomia-bpf/bpftime #495
- fix: update test-ptxpass.yml for proxy — eunomia-bpf/bpftime #494
- fix: generate build-time header for PTX pass paths to avoid shell escaping issues — eunomia-bpf/bpftime #493
- Fix scheduler test script to use relative paths — eunomia-bpf/schedcp Add workflow for deployment #4
- Fix: fix SIGSEGV from server log — eunomia-bpf/bpftime #492
- [Bug Fix] Fix SHM open_type judgement — eunomia-bpf/bpftime #489
- [+] CI: Add test for ptx_pass, fix AI generated code, tidy the code — eunomia-bpf/bpftime #485
- feat: support early exit (#459) — eunomia-bpf/bpftime #481
- Replace non-existent function with
verify_scheduler— eunomia-bpf/schedcp Add README #3 - Support patching multiple ptx in fatbin — eunomia-bpf/bpftime #478
Commits
- Add GPU and CPU performance profiling tools — eunomia-bpf/xpu-perf @8110fac (2025-11-02T16:37:09-08:00)
- Refactor code structure for improved readability and maintainability — eunomia-bpf/xpu-perf @4086f91 (2025-11-02T16:14:21-08:00)
- Add uprobe profiler implementation with simple reporting — eunomia-bpf/xpu-perf @cd15acd (2025-11-02T14:12:45-08:00)
- fix: Correct context value usage in SimpleReporter and update test program to pass context_id — eunomia-bpf/xpu-perf @a7a3ce5 (2025-11-10T17:35:51Z)
- Refactor SimpleReporter to support custom trace events and remove unused symbol resolution code — eunomia-bpf/xpu-perf @dd9a897 (2025-11-10T17:14:59Z)
- feat: Implement eBPF uprobes support and add simple reporter for trace events — eunomia-bpf/xpu-perf @b7cbfd9 (2025-11-10T16:38:20Z)
- Add CUPTI PC Sampling example application — eunomia-bpf/xpu-perf @d37c8d2 (2025-11-10T07:30:09Z)
- Refactor code structure for improved readability and maintainability — eunomia-bpf/xpu-perf @e873d48 (2025-11-06T09:25:10-08:00)
- Add eBPF support for capturing CUPTI correlation IDs — eunomia-bpf/xpu-perf @7668dc0 (2025-11-10T20:03:00Z)
- feat: Update PyTorch training script to use ResNet-18 model and include loss calculation — eunomia-bpf/xpu-perf @1dce803 (2025-11-11T20:54:33-08:00)
- feat: Implement CUDA kernel and correlation ID uprobes for enhanced profiling — eunomia-bpf/xpu-perf @9f68304 (2025-11-11T22:04:54Z)
- feat: Enhance Makefile to include test application builds and improve testing process — eunomia-bpf/xpu-perf @cac887b (2025-11-16T14:30:50-08:00)
- feat: Add eBPF Go bindings generation to Makefile — eunomia-bpf/xpu-perf @b7582b0 (2025-11-16T14:25:45-08:00)
- Remove vectoradd flamegraph SVG and folded stack files — eunomia-bpf/xpu-perf @aefc551 (2025-11-16T16:55:05-08:00)
- Add script to run all NVIDIA bpftrace tests and log results — eunomia-bpf/xpu-perf @3b6ada7 (2025-11-16T20:43:29-08:00)
- feat: Add untested NVIDIA function tests and update test runner — eunomia-bpf/xpu-perf @fdd954e (2025-11-16T22:44:38-08:00)
- Enhance NVIDIA bpftrace test runner with monolithic support and logging improvements — eunomia-bpf/xpu-perf @9a698c4 (2025-11-16T22:34:14-08:00)
- feat: Update command-line interface to support subcommands for profiling modes — eunomia-bpf/xpu-perf @275f2f2 (2025-11-16T19:48:01-08:00)
- Enhance profiler README and correlation logic for improved clarity and accuracy — eunomia-bpf/xpu-perf @506973e (2025-11-03T01:53:25-08:00)
- feat: Enhance CPU/GPU correlation by supporting uprobes in merge mode and improving trace reporting — eunomia-bpf/xpu-perf @310f388 (2025-11-03T01:43:59-08:00)
- feat: Enhance CPU/GPU correlation with uprobes and add symbol resolution support — eunomia-bpf/xpu-perf @8506454 (2025-11-03T01:27:08-08:00)
- Refactor code structure for improved readability and maintainability — eunomia-bpf/xpu-perf @d338915 (2025-11-03T00:10:50-08:00)
- feat(profiler): add GPU+CPU performance profiler with CUPTI support — eunomia-bpf/xpu-perf @02e5131 (2025-11-03T00:02:20-08:00)
- Enhance CUDA Graph Tracking and eBPF Example — eunomia-bpf/xpu-perf @d965bbc (2025-11-02T17:51:26-08:00)
- feat: Add SimpleReporter for UProbe trace reporting — eunomia-bpf/xpu-perf @6b2ce3e (2025-11-02T17:38:26-08:00)
- Enhance minimal PyTorch script with additional tensor and neural network operations — eunomia-bpf/xpu-perf @3955975 (2025-11-11T21:40:30-08:00)
- fix: Improve symbol resolution for executable paths in stack trace extraction — eunomia-bpf/xpu-perf @8b26025 (2025-11-11T20:34:48-08:00)
- fix: Filter out uprobes without stack frames to improve correlation accuracy — eunomia-bpf/xpu-perf @4745b15 (2025-11-11T18:54:24-08:00)
- feat: Update Go version to 1.21 in CI workflows and refactor Makefile for improved build process — eunomia-bpf/xpu-perf @1012d7b (2025-11-03T21:02:40-08:00)
- feat: Add BPF trace script for chunk to VA block mapping — eunomia-bpf/xpu-perf @64547dc (2025-11-23T23:37:54-08:00)
- Add scripts for analyzing BPF chunk behavior and tracing hooks — eunomia-bpf/xpu-perf @87bde3d (2025-11-23T22:55:32-08:00)
- feat: Update README for GPU profiling command and add RAG2 demo script — eunomia-bpf/xpu-perf @965894a (2025-11-24T11:59:10-08:00)
- feat: Add bpftrace scripts for prefetch hint generation and thrashing detection — eunomia-bpf/xpu-perf @f26acbb (2025-11-19T22:42:35-08:00)
- feat: Add bpftrace script for tracing uvm_va_block_select_residency function with detailed metrics — eunomia-bpf/xpu-perf @12ea28e (2025-11-19T14:44:33-08:00)
- docs: Add README entry for available filter functions in tracing — eunomia-bpf/xpu-perf @c6a3189 (2025-11-19T12:32:40-08:00)
- Add GPU File I/O Benchmark and GDS Support Tools — eunomia-bpf/co-processor-demo @0bdd7ba (2025-11-10T04:36:25Z)
- Add comprehensive scripts and documentation for QEMU VM setup with GPU passthrough — eunomia-bpf/co-processor-demo @8501624 (2025-11-10T17:40:54-08:00)
- Add combined metrics table for workload cases in temp_quick_analysis.py — eunomia-bpf/co-processor-demo @e487f3c (2025-11-10T16:55:30-08:00)
- Add CLC scheduling policies and benchmarking framework — eunomia-bpf/co-processor-demo @3eaf4ed (2025-11-05T18:16:49-08:00)
- Add Windows compatibility for memory mapping and time functions — eunomia-bpf/co-processor-demo @7fdad6f (2025-11-05T21:51:43-08:00)
- Add CLC Scheduling Policy Benchmark Results and new workload policies — eunomia-bpf/co-processor-demo @e9b11ce (2025-11-05T21:34:51-08:00)
- Enhance CLC scheduling policies and benchmarks: add workload scaling parameters, modify stealing logic to include current block ID, and update benchmark script for improved configurability. — eunomia-bpf/co-processor-demo @ecadc4b (2025-11-05T20:11:27-08:00)
- Refactor CLC scheduling policies: rename ThrottledPolicy to NeverStealPolicy and update related benchmark references. Enhance plot generation by dynamically extracting policy names for improved readability. — eunomia-bpf/co-processor-demo @f7024f6 (2025-11-05T19:13:14-08:00)
- Refactor CLC scheduling policies to remove unnecessary keep_going_after_success callbacks and update documentation for clarity. Modify benchmark script to change plot generation flag for improved usability. — eunomia-bpf/co-processor-demo @5b186ce (2025-11-05T18:40:55-08:00)
- Add QEMU development environment setup scripts and documentation — eunomia-bpf/co-processor-demo @4722c02 (2025-11-10T06:20:30Z)
- Update CUDA architecture and memory management in benchmarks — eunomia-bpf/co-processor-demo @49c1922 (2025-11-11T07:20:37Z)
- up — eunomia-bpf/co-processor-demo @6927622 (2025-11-11T07:03:00Z)
- Add UVM benchmark results and testing process documentation — eunomia-bpf/co-processor-demo @a7cf018 (2025-11-11T06:26:47Z)
- Add evaluation section for OSDI paper on Unified Virtual Memory — eunomia-bpf/co-processor-demo @0c96708 (2025-11-11T07:51:12Z)
- Add CLC Workload Analysis and Benchmarking Framework — eunomia-bpf/co-processor-demo @2503e52 (2025-11-11T05:29:54Z)
- Add Multi-Stream GPU Scheduler Benchmark with Makefile, README, and CUDA implementation — eunomia-bpf/co-processor-demo @22cf837 (2025-11-12T00:10:19-08:00)
- Add UVMBench: A comprehensive benchmark suite for researching Unified Virtual Memory in GPUs — eunomia-bpf/co-processor-demo @08d3dda (2025-11-11T23:18:33-08:00)
- Add support for RQ6: Load Imbalance and Fairness analysis — eunomia-bpf/co-processor-demo @254fc02 (2025-11-12T00:45:41-08:00)
- Add UVM Benchmark Testing Process documentation and update Makefiles for CUDA compatibility — eunomia-bpf/co-processor-demo @087845d (2025-11-11T11:45:29-08:00)
- Add comprehensive evaluation design summary for UVM benchmark study — eunomia-bpf/co-processor-demo @4dd4903 (2025-11-11T19:22:31Z)
- Add xsched submodule for scheduler integration — eunomia-bpf/co-processor-demo @506a919 (2025-11-12T08:21:18-08:00)
- Add Windows compatibility for memory management functions — eunomia-bpf/co-processor-demo @34fff2c (2025-11-11T11:22:58-08:00)
- refactor: Update .gitignore and enhance README; modify kernel parameters for improved performance and usability — eunomia-bpf/co-processor-demo @06765ee (2025-11-16T21:37:23-08:00)
- Refactor and update UVM benchmark suite — eunomia-bpf/co-processor-demo @2345f95 (2025-11-16T20:57:35-08:00)
- refactor: Add Makefile and .gitignore for GEMM CUDA test application setup — eunomia-bpf/co-processor-demo @d36b73f (2025-11-17T14:56:10-08:00)
- Refactor Makefile and remove unused wrapper kernel to streamline build process; update gemm_policy_wrapper.h and gemm_test.cu for improved kernel launching with optional policy framework. — eunomia-bpf/co-processor-demo @352ef7d (2025-11-18T00:37:19-08:00)
- Enhance configuration loading by allowing model directory specification in load_config and build_tokenizer functions — eunomia-bpf/co-processor-demo @9cce0ac (2025-11-17T22:59:46-08:00)
- Add CUPTI-based kernel interception and policy enforcement for cuBLAS — eunomia-bpf/co-processor-demo @2796888 (2025-11-17T22:45:21-08:00)
- Remove --no-device-link flag from Makefiles and documentation across various projects to align with updated CUDA compilation practices. This change enhances compatibility with newer CUDA versions while maintaining performance optimizations. The modifications include adjustments in Makefiles for GPU examples, memory benchmarks, and scheduler components, as well as updates to README files to reflect the removal of the flag in example commands and instructions. — eunomia-bpf/co-processor-demo @48367f2 (2025-11-17T22:40:01-08:00)
- Refactor code structure for improved readability and maintainability — eunomia-bpf/co-processor-demo @6e8b5d5 (2025-11-17T22:16:40-08:00)
- Refactor CLC benchmark by removing the policy comparison file and updating Makefile and .gitignore accordingly — eunomia-bpf/co-processor-demo @633c7f1 (2025-11-05T14:33:38-08:00)
- Add specialized scheduling policies and benchmarking framework — eunomia-bpf/co-processor-demo @04f8862 (2025-11-05T14:23:54-08:00)
- Refactor CLC scheduler policy interface to hold state in shared memory and enforce uniform control flow — eunomia-bpf/co-processor-demo @ca4f915 (2025-11-05T14:07:28-08:00)
- Refactor GPU scheduling policy framework and enhance file I/O benchmarks — eunomia-bpf/co-processor-demo @ec2ddbf (2025-11-05T12:17:50-08:00)
- Add CLC Benchmark and GPU File I/O examples — eunomia-bpf/co-processor-demo @22c82b1 (2025-11-05T11:35:19-08:00)
- Add Cluster Launch Control (CLC) examples and benchmarks for CUDA 12.9 — eunomia-bpf/co-processor-demo @1036638 (2025-11-04T21:40:37-08:00)
- Remove outdated CUTLASS test results, scripts, and submodule; add QLoRA fine-tuning scripts and requirements — eunomia-bpf/co-processor-demo @dbe1cd6 (2025-11-04T21:23:23-08:00)
- Add AI workload simulations and benchmarking scripts — eunomia-bpf/co-processor-demo @0db8b46 (2025-11-04T22:25:35-08:00)
- Refactor CLC Benchmark Workloads for AI Inference Scenarios — eunomia-bpf/co-processor-demo @c0d460a (2025-11-04T22:03:23-08:00)
- Add workload analysis benchmark and update Makefile for CLC — eunomia-bpf/co-processor-demo @79a0b6d (2025-11-04T21:45:34-08:00)
- Add comprehensive Cluster Launch Control (CLC) benchmark with configurable workloads — eunomia-bpf/co-processor-demo @6b43750 (2025-11-04T21:46:10-08:00)
- Enhance CLC scheduler policy interface with additional callbacks for improved control over work stealing behavior — eunomia-bpf/co-processor-demo @ea11910 (2025-11-05T00:58:43-08:00)
- Refactor scheduler policy interface to simplify callbacks and enhance control over work stealing behavior — eunomia-bpf/co-processor-demo @4a1237f (2025-11-05T00:49:41-08:00)
- Refactor CLC benchmark Makefile and enhance policy benchmark documentation — eunomia-bpf/co-processor-demo @c3aa4b1 (2025-11-05T00:29:48-08:00)
- Add CLC policy benchmark and scheduler policy interface implementation — eunomia-bpf/co-processor-demo @f3161c3 (2025-11-04T23:00:13-08:00)
- Add CLC performance analysis for real-world AI workloads — eunomia-bpf/co-processor-demo @cd47800 (2025-11-04T22:29:06-08:00)
- Implement cuBLAS dynamic linking with runtime policy enforcement — eunomia-bpf/co-processor-demo @91a356c (2025-11-17T18:43:33-08:00)
- refactor: Enhance runtime dynamic linking by modifying Makefile and updating kernel invocation in wrapper — eunomia-bpf/co-processor-demo @a70d7e4 (2025-11-17T17:16:06-08:00)
- refactor: Update Makefile for dynamic linking and remove gemm_kernel.cu — eunomia-bpf/co-processor-demo @3a67c8b (2025-11-17T17:02:49-08:00)
- refactor: Implement runtime dynamic linking for GEMM kernel and policy integration — eunomia-bpf/co-processor-demo @e9b2391 (2025-11-17T16:53:25-08:00)
- refactor: Add RQ4-RQ6 sections for UVM prefetching, thrashing detection, and access counter migration experiments — eunomia-bpf/co-processor-demo @44abbb0 (2025-11-17T16:36:17-08:00)
- fix: Update baseline size factor reference in UVM slowdown calculation — eunomia-bpf/co-processor-demo @a89a1b3 (2025-11-17T15:52:30-08:00)
- Add Windows compatibility for memory management functions — eunomia-bpf/co-processor-demo @d1ed31a (2025-11-17T22:12:41-08:00)
- refactor: Consolidate size factors for kernel benchmarks to ensure consistent runtime across configurations — eunomia-bpf/co-processor-demo @cd3defc (2025-11-16T23:47:12-08:00)
- refactor: Enhance pointer chase kernel with GPU-based node initialization and multi-segment design for oversubscription testing — eunomia-bpf/co-processor-demo @f1d380e (2025-11-16T23:27:02-08:00)
- refactor: Add defensive checks for iterations and update input/output byte calculations in synthetic kernel — eunomia-bpf/co-processor-demo @af3b2d1 (2025-11-16T23:01:28-08:00)
- Add UVM memory management strategies and benchmark results for various kernels — eunomia-bpf/co-processor-demo @c2a0319 (2025-11-19T00:54:55-08:00)
- Refactor GEMM example to use nvJitLink policy framework — eunomia-bpf/co-processor-demo @71c04bb (2025-11-18T13:53:20-08:00)
- Add GEMM kernel implementation and policy framework integration; update .gitignore to include new test files. — eunomia-bpf/co-processor-demo @b354e2d (2025-11-18T13:44:36-08:00)
- Enhance README and policy framework with device function pointers; introduce generic wrapper kernel template for simplified kernel integration with policies. — eunomia-bpf/co-processor-demo @ea48f93 (2025-11-18T13:11:03-08:00)
- Refactor Makefile and source files to streamline policy application; enhance policy function pointer handling and update kernel execution logic. — eunomia-bpf/co-processor-demo @01de14f (2025-11-18T12:51:36-08:00)
- Refactor Makefile and source files to implement policy wrappers for extracted kernels; update policy application logic and enhance PTX extraction process. — eunomia-bpf/co-processor-demo @702a4a4 (2025-11-18T12:44:47-08:00)
- Add nvJitLink policy framework demo with user kernel integration and runtime policy application — eunomia-bpf/co-processor-demo @33d16ad (2025-11-18T12:25:25-08:00)
- Refactor Makefile and gemm_test.cu to link GEMM kernel with policy at runtime; remove user_kernel.cu as it is no longer needed. — eunomia-bpf/co-processor-demo @273e088 (2025-11-18T12:24:30-08:00)
- Add binary extraction and JIT rewriting framework — eunomia-bpf/co-processor-demo @685d17b (2025-11-18T12:12:33-08:00)
- Add demos for dynamic loading, cuGetProcAddress, nvJitLink, nvFatbin, and context-independent loading; enhance README files with detailed usage instructions and examples for each feature. — eunomia-bpf/co-processor-demo @3a7ba10 (2025-11-18T11:46:55-08:00)
- Add scheduling policies and GEMM benchmark for CUDA — eunomia-bpf/co-processor-demo @75482df (2025-11-18T19:33:49-08:00)
- Add cuBLAS demo with Makefile, README, and .gitignore — eunomia-bpf/co-processor-demo @6e5b3cd (2025-11-21T22:35:59-08:00)
- Remove QLoRA fine-tuning files and dependencies from the repository — eunomia-bpf/co-processor-demo @4cd2e6b (2025-11-21T23:13:26-08:00)
- refactor: Enhance random access pattern in rand_chunk_kernel and update size factors for oversubscription characterization — eunomia-bpf/co-processor-demo @453d1c6 (2025-11-17T00:48:49-08:00)
- feat: Add GEMM kernel implementation with Makefile and README documentation — eunomia-bpf/co-processor-demo @6dc3cd3 (2025-11-17T20:19:04-08:00)
- Refactor code structure for improved readability and maintainability — eunomia-bpf/co-processor-demo @2ea8b02 (2025-11-15T14:22:35-08:00)
- tgg — eunomia-bpf/co-processor-demo @330f19b (2025-11-15T13:38:25-08:00)
- Enhance RQ4 and RQ5 modules for improved preemption latency measurement and analysis — eunomia-bpf/co-processor-demo @c59da6d (2025-11-15T14:53:10-08:00)
- refactor: Enhance RQ6 module to include 'pattern' column in CSV output and improve heterogeneous type handling — eunomia-bpf/co-processor-demo @0543b62 (2025-11-15T22:12:55-08:00)
- Add inline policy support for CLC scheduler with new policies and update Makefile — eunomia-bpf/co-processor-demo @9bdcb65 (2025-11-18T18:23:44-08:00)
- Add NoStealPolicy implementation and update Makefile and benchmark driver — eunomia-bpf/co-processor-demo @a0f72db (2025-11-18T17:40:56-08:00)
- Add CLC Policy Framework for Qwen3 CUDA Inference — eunomia-bpf/co-processor-demo @382b454 (2025-11-18T15:13:22-08:00)
- Add GEMM Benchmark System with comprehensive documentation and visualization — eunomia-bpf/co-processor-demo @49ca80e (2025-11-18T14:46:21-08:00)
- ds — eunomia-bpf/co-processor-demo @54266fa (2025-11-18T14:39:38-08:00)
- Add policy framework and GEMM kernel with policy support — eunomia-bpf/co-processor-demo @b640ce1 (2025-11-18T14:26:07-08:00)
- Add policy framework and GEMM kernel with nvJitLink support — eunomia-bpf/co-processor-demo @6457cae (2025-11-18T14:12:26-08:00)
- Add workload submodule for llama.cpp and enhance visualization scripts for dynamic subplot creation — eunomia-bpf/co-processor-demo @a1ec5e8 (2025-11-18T19:04:59-08:00)
- Add debug trace option to benchmark for detailed kernel execution analysis — eunomia-bpf/co-processor-demo @8ca6d2c (2025-11-14T00:46:47-08:00)
- Enhance RQ3 analysis to include end-to-end latency calculations and update metrics output for priority classes — eunomia-bpf/co-processor-demo @10edd17 (2025-11-14T00:17:58-08:00)
- Add multi-stream scheduler benchmark with metrics and common utilities — eunomia-bpf/co-processor-demo @257a47a (2025-11-13T22:38:53-08:00)
- Refactor and remove RQ4-RQ7 analyzers and experiment driver code — eunomia-bpf/co-processor-demo @c0d7856 (2025-11-14T16:22:16-08:00)
- Refactor multi-stream analysis and metrics handling — eunomia-bpf/co-processor-demo @dae4c88 (2025-11-14T16:17:20-08:00)
- Refactor benchmark configuration by removing debug trace option and update metrics computation to include end-to-end latency and service time metrics. — eunomia-bpf/co-processor-demo @8cecc5c (2025-11-14T16:08:29-08:00)
- Add CSV output option and random seed control to benchmark configuration — eunomia-bpf/co-processor-demo @43131cb (2025-11-14T15:44:33-08:00)
- Refactor multi-stream benchmark for modular metrics computation and enhance priority handling — eunomia-bpf/co-processor-demo @401b2a9 (2025-11-14T15:32:36-08:00)
- fix: Update SVM Makefiles and source files for CUDA compatibility; change C++ standard to C++17 and include necessary headers — eunomia-bpf/co-processor-demo @6a31700 (2025-11-14T23:08:57-08:00)
- feat: Enhance RQ4-RQ8 analyzers with improved configuration detection, validation, and pattern classification; add new metrics and visualizations — eunomia-bpf/co-processor-demo @75f2a78 (2025-11-14T22:28:55-08:00)
- feat: Enhance RQ1-RQ3 and RQ4-RQ8 analyzers with improved load calculations and filtering; add new metrics output — eunomia-bpf/co-processor-demo @ffab558 (2025-11-14T20:29:51-08:00)
- refactor: Clean up CSV output handling in ExperimentDriver and update plotting logic in RQ1-RQ3 and RQ4-RQ8 analyzers — eunomia-bpf/co-processor-demo @16bb6f8 (2025-11-14T19:40:55-08:00)
- feat: Implement experiment driver for GPU scheduler research — eunomia-bpf/co-processor-demo @abc425b (2025-11-14T17:56:49-08:00)
- Enhance RQ3 analysis to include detailed performance metrics and visualizations for CUDA priority mechanism, and update experiment driver to support detailed per-kernel output. — eunomia-bpf/co-processor-demo @d60a968 (2025-11-13T21:34:18-08:00)
- Enhance RQ2 workload characterization to support varying stream counts in experiment driver and analysis — eunomia-bpf/co-processor-demo @5c767cb (2025-11-13T21:08:44-08:00)
- Implement multi-threaded kernel launching and workload size variations for RQ1 analysis — eunomia-bpf/co-processor-demo @4291b39 (2025-11-13T17:34:57-08:00)
- Enhance RQ1 analysis and experiment driver to include workload size variations for stream scalability — eunomia-bpf/co-processor-demo @93c621e (2025-11-13T17:20:18-08:00)
- Enhance multi-stream analysis and experiment framework for heterogeneous workloads — eunomia-bpf/co-processor-demo @dd81291 (2025-11-13T16:32:18-08:00)
- refactor: Update RQ modules and figures for Multi-Process vs Single-Process analysis; rename sub-RQs and enhance CSV output with per-process metrics — eunomia-bpf/co-processor-demo @b5bd1f6 (2025-11-15T16:24:21-08:00)
- Add NVIDIA UVM Kernel Parameters Configuration Guide — eunomia-bpf/co-processor-demo @53ac820 (2025-11-19T22:13:21-08:00)
- Add gpu_ext_policy submodule for external GPU scheduling policies — eunomia-bpf/co-processor-demo @d3e8001 (2025-11-19T22:26:47-08:00)
- fix: update image links to use absolute URLs in README and SUMMARY templates — eunomia-bpf/bpf-developer-tutorial @01ee4ea (2025-11-03T23:16:14-08:00)
- docs: auto-generate documentation — eunomia-bpf/bpf-developer-tutorial @134ee88 (2025-11-04T07:16:27Z)
- feat: Add book recommendations and promotional content for eBPF learning resources — eunomia-bpf/bpf-developer-tutorial @285e4f2 (2025-11-03T21:39:29-08:00)
- feat: Add book recommendation for eBPF observability and introduce configuration for GPU flamegraph — eunomia-bpf/bpf-developer-tutorial @75ec466 (2025-11-03T22:13:59-08:00)
- docs: auto-generate documentation — eunomia-bpf/bpf-developer-tutorial @81be963 (2025-11-04T06:14:12Z)
- docs: auto-generate documentation — eunomia-bpf/bpf-developer-tutorial @b9b82fa (2025-11-04T05:39:43Z)
- feat: Update TOC generation functions to include output file directory parameter and adjust lesson paths in SUMMARY files — eunomia-bpf/bpf-developer-tutorial @fd03b86 (2025-11-12T07:39:14-08:00)
- docs: auto-generate documentation — eunomia-bpf/bpf-developer-tutorial @a1430cf (2025-11-12T15:39:28Z)
- feat: Add initial implementation of struct_ops BPF program and kernel module with kfunc support — eunomia-bpf/bpf-developer-tutorial @2c9870b (2025-11-08T15:51:36-08:00)
- fix: Remove bpf_printk calls from struct_ops BPF programs to comply with restrictions — eunomia-bpf/bpf-developer-tutorial @540ce67 (2025-11-15T22:24:51Z)
- feat: Update struct_ops to change test_3 return type to int and add BPF helper support — eunomia-bpf/bpf-developer-tutorial @86c2f72 (2025-11-15T23:54:20Z)
- feat: Enhance test_3 to safely read from kernel buffer and return length — eunomia-bpf/bpf-developer-tutorial @8372e9b (2025-11-16T06:19:28Z)
- fix(32-wallclock-profiler): use absolute path to find tool paths (#186) — eunomia-bpf/bpf-developer-tutorial @ca92713 (2025-11-09T10:20:35-08:00)
- docs: auto-generate documentation — eunomia-bpf/bpf-developer-tutorial @78bad82 (2025-11-10T18:31:03Z)
- feat: Implement BPF struct_ops example with custom kernel module and user-space loader — eunomia-bpf/bpf-developer-tutorial @5ce10be (2025-11-10T18:30:48Z)
- Add option for LLVM preload workaround in CMake and update JIT context (#38) — eunomia-bpf/llvmbpf @9ea0180 (2025-11-23T17:36:19+08:00)
- Update for buffer size & spin_lock setting (#37) — eunomia-bpf/llvmbpf @e54f5aa (2025-11-22T22:13:16-08:00)
- Refactor CUDA library detection and linking in CMakeLists.txt — eunomia-bpf/llvmbpf @f6c531d (2025-11-22T22:15:45-08:00)
- Add Spirv support (#35) — eunomia-bpf/llvmbpf @b8e4eea (2025-11-22T22:15:45-08:00)
- Update README and example PTX documentation to clarify GPU architecture detection and compilation options. Remove references to automatic compute capability detection for PTX generation, and note that the example code is hardcoded for a specific GPU architecture. Enhance clarity on build instructions and output examples. — eunomia-bpf/llvmbpf @4e36518 (2025-11-22T22:15:15-08:00)
- Published post and updated queue [skip ci] — eunomia-bpf/eunomia.dev @2f07894 (2025-11-18T07:12:50Z)
- Published post and updated queue [skip ci] — eunomia-bpf/eunomia.dev @549f69a (2025-11-25T07:13:41Z)
- Revise ideas.md with new project proposals and structure — eunomia-bpf/eunomia.dev @13249f1 (2025-11-17T12:02:30-08:00)
- Document APX-aware JIT backend project — eunomia-bpf/eunomia.dev @1b31ac5 (2025-11-17T11:46:32-06:00)
- Published post and updated queue [skip ci] — eunomia-bpf/eunomia.dev @749ccc5 (2025-11-11T07:12:53Z)
- Update for modify contents — eunomia-bpf/eunomia.dev @ddb7696 (2025-11-10T18:36:07Z)
- Update for modify contents — eunomia-bpf/eunomia.dev @ffb3e35 (2025-11-09T18:25:58Z)
- Revise GPU observability challenges section to enhance clarity and detail on existing tools' limitations and the need for eBPF integration — eunomia-bpf/eunomia.dev @81f795e (2025-11-06T13:25:08-08:00)
- Refine GPU observability challenges section by expanding on existing tools' limitations and introducing a third category for fine-grained analysis tools — eunomia-bpf/eunomia.dev @8636011 (2025-11-06T13:21:50-08:00)
- Enhance GPU observability discussion by addressing production challenges and introducing continuous profiling solutions with eBPF integration — eunomia-bpf/eunomia.dev @b3c3e04 (2025-11-06T13:21:50-08:00)
- Add GPU observability challenges document and enhance related presentations — eunomia-bpf/eunomia.dev @ec94a5a (2025-11-06T13:21:50-08:00)
- Update for modify contents — eunomia-bpf/eunomia.dev @3c21089 (2025-11-08T23:57:03Z)
- Add teaser for upcoming performance benchmarks in GPU observability architecture section — eunomia-bpf/eunomia.dev @971ed55 (2025-11-06T13:26:52-08:00)
- Update for modify contents — eunomia-bpf/eunomia.dev @4f55bb9 (2025-11-04T05:44:46Z)
- Published post and updated queue [skip ci] — eunomia-bpf/eunomia.dev @c0930ab (2025-11-04T07:12:28Z)
- Update for modify contents — eunomia-bpf/eunomia.dev @c8e1cc3 (2025-11-04T06:19:09Z)
- Update for modify contents — eunomia-bpf/eunomia.dev @f0df412 (2025-11-04T07:21:30Z)
- Update for modify contents — eunomia-bpf/eunomia.dev @4874e0a (2025-11-12T15:44:46Z)
- Refactor driver architecture description to emphasize modular design and OS-agnostic components, enhancing clarity on pre-compiled binaries and their role in cross-kernel compatibility. — eunomia-bpf/eunomia.dev @6459ca9 (2025-11-12T07:53:42-08:00)
- Update for modify contents — eunomia-bpf/eunomia.dev @d114d3b (2025-11-16T06:24:56Z)
- Update for modify contents — eunomia-bpf/eunomia.dev @867ff6d (2025-11-15T22:30:14Z)
- Update for modify contents — eunomia-bpf/eunomia.dev @117d561 (2025-11-15T23:59:49Z)
- Add OpenAI Proxy POCs with Reverse and MITM Proxy Implementations — eunomia-bpf/agentsight @f2ad7e2 (2025-11-03T00:41:22-08:00)
- ix the issue where the eBPF verifier rejects the probe_SSL_write_ex_exit (#21) — eunomia-bpf/agentsight @7d9de42 (2025-11-20T21:42:59-08:00)
- feat: Implement HTTP decompressor for gzip/deflate encoded responses; add tests and example server script — eunomia-bpf/agentsight @fe422e9 (2025-11-06T21:21:31-08:00)
- Add prefetch_always_max and prefetch_none BPF programs with necessary updates — eunomia-bpf/gpu_ext_policy @26e1420 (2025-11-23T01:41:37-08:00)
- Refactor code structure for improved readability and maintainability — eunomia-bpf/gpu_ext_policy @6f2d812 (2025-11-23T01:34:48-08:00)
- Move kfunc test for substring search from test_1 to test_3 — eunomia-bpf/gpu_ext_policy @c38b456 (2025-11-23T00:32:55-08:00)
- Add always_max BPF program and cleanup tool for struct_ops instances — eunomia-bpf/gpu_ext_policy @24b699a (2025-11-23T00:42:52-08:00)
- Refactor struct_ops to implement uvm_bpf_test_trigger_kfunc and update cleanup logic for struct_ops map — eunomia-bpf/gpu_ext_policy @50d8e86 (2025-11-23T00:39:29-08:00)
- Refactor code structure for improved readability and maintainability — eunomia-bpf/gpu_ext_policy @4fa280a (2025-11-23T00:58:52-08:00)
- Refactor struct_ops/test_1 to include kfunc test for substring search — eunomia-bpf/gpu_ext_policy @aa81dbe (2025-11-22T18:59:32-08:00)
- Enhance adaptive prefetch policy with NVML integration for PCIe throughput monitoring and update threshold dynamically — eunomia-bpf/gpu_ext_policy @41b5b75 (2025-11-23T11:53:46-08:00)
- Add adaptive threshold prefetch policy implementation and update Makefile — eunomia-bpf/gpu_ext_policy @49d6dcb (2025-11-23T11:14:11-08:00)
- Define uvm_gpu_chunk_struct for BPF access to chunk list field — eunomia-bpf/gpu_ext_policy @6834871 (2025-11-23T17:36:11-08:00)
- Add FIFO eviction policy implementation for GPU memory management — eunomia-bpf/gpu_ext_policy @1f2847e (2025-11-23T17:14:03-08:00)
- Add new function pointer for chunk activation in uvm_gpu_ext struct — eunomia-bpf/gpu_ext_policy @69bacbb (2025-11-24T02:54:12-08:00)
- Remove LFU eviction policy section from README.md — eunomia-bpf/gpu_ext_policy @8b9441a (2025-11-24T03:02:11-08:00)
- Update README.md to clarify chunk list structure and eviction process — eunomia-bpf/gpu_ext_policy @6e38654 (2025-11-24T02:59:42-08:00)
- Add scripts for analyzing and visualizing chunk trace data — eunomia-bpf/gpu_ext_policy @25e4306 (2025-11-24T01:44:09-08:00)
- Refactor FIFO eviction policy by removing chunk populate hook and updating struct_ops — eunomia-bpf/gpu_ext_policy @6778f82 (2025-11-24T01:07:45-08:00)
- Refactor chunk trace statistics and remove depopulate hook from FIFO eviction policy — eunomia-bpf/gpu_ext_policy @4c68e49 (2025-11-24T00:08:49-08:00)
- Add scripts for GPU chunk trace analysis and VA block mapping — eunomia-bpf/gpu_ext_policy @8843457 (2025-11-23T23:38:01-08:00)
- Add chunk trace analysis script and enhance BPF hook tracing with VA block information — eunomia-bpf/gpu_ext_policy @628e199 (2025-11-23T23:18:28-08:00)
- Add chunk trace tool for BPF hook call tracing and update Makefile — eunomia...[Issue body truncated]
Metadata
Metadata
Assignees
Labels
No labels