feat: add device id #779

michaelfeil · 2025-12-18T00:36:28Z

What does this PR do?

This adds a option to the CLI to select the device ID.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the documentation guidelines.
Did you write any new necessary tests? If applicable, did you include or update the insta snapshots?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

OlivierDehaene · 2025-12-19T12:47:48Z

Why not use the CUDA_VISIBLE_DEVICES env var?

michaelfeil · 2025-12-19T18:15:53Z

@OlivierDehaene good to see you. I actually want to add py03 bindings for the backend, and wiring that though would help. Env vars can't change, so its more about adding this capability into backend.

alvarobartt

Thank you @michaelfeil, I've left some comments!

P.S. Agree with @OlivierDehaene that using CUDA_VISIBLE_DEVICES would be ideal, but given that this might help with your use-case it's probably the same with other users, so I'm happy to move forward, unless Olivier thinks otherwise, I don't have a strong opinion against this change 🤗

alvarobartt · 2025-12-23T17:24:22Z

router/src/main.rs

    #[clap(long, env)]
    dense_path: Option<String>,

+    /// The device ID to use for CUDA/Metal devices. Defaults to 0.


Suggested change

/// The device ID to use for CUDA/Metal devices. Defaults to 0.

/// The CUDA device ID where the model will be loaded. Defaults to 0 i.e., the first available device.

alvarobartt · 2025-12-23T17:25:39Z

backends/candle/src/lib.rs

            Ok(Device::Cpu)
        } else if candle::utils::metal_is_available() {
-            Device::new_metal(0)
+            Device::new_metal(device_id)


AFAIK there are no instances where you have more than one M-chip on MacOS, right?

Suggested change

Device::new_metal(device_id)

Device::new_metal(0)

alvarobartt · 2025-12-23T17:27:02Z

backends/candle/src/lib.rs

            #[cfg(feature = "cuda")]
            match compatible_compute_cap() {
-                Ok(true) => Device::new_cuda(0),
+                Ok(true) => Device::new_cuda(device_id),


IMO we should try to add some sort of validation here, to ensure that the device_id is within the bounds of the available devices in the given instance, or is the candle error enough?

michaelfeil added 3 commits December 18, 2025 00:35

add device id

964c22b

add device id

24ab9aa

another test

e04dd63

alvarobartt reviewed Dec 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add device id #779

feat: add device id #779

Uh oh!

michaelfeil commented Dec 18, 2025

Uh oh!

OlivierDehaene commented Dec 19, 2025

Uh oh!

michaelfeil commented Dec 19, 2025

Uh oh!

alvarobartt left a comment •

edited

Loading

Uh oh!

alvarobartt Dec 23, 2025

Uh oh!

alvarobartt Dec 23, 2025

Uh oh!

alvarobartt Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	/// The device ID to use for CUDA/Metal devices. Defaults to 0.
	/// The CUDA device ID where the model will be loaded. Defaults to 0 i.e., the first available device.

feat: add device id #779

Are you sure you want to change the base?

feat: add device id #779

Uh oh!

Conversation

michaelfeil commented Dec 18, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

OlivierDehaene commented Dec 19, 2025

Uh oh!

michaelfeil commented Dec 19, 2025

Uh oh!

alvarobartt left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alvarobartt Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

alvarobartt Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

alvarobartt Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alvarobartt left a comment •

edited

Loading