Preview Vector Set API support #1346

kevin-montrose · 2025-08-14T21:41:58Z

Implement a small subset of Vector Set functionality, as a base for future full support.

This PR:

Uses DiskANN to do the actual vector operations
- While not available yet, latest DiskANN (and the diskann-garnet integration package) should be OSS'd soon
- The expectation is, even if reviewed and approved, this PR will not be merged until diskann-garnet is available in nuget.org (and ideally the source is also available on GitHub)
Introduces a notion of namespaces to Tsavorite, which is used to "hide" data from other commands
Adds a VectorSet bit to RecordInfo to allow distinguishing Vector Sets from other keys
Implements a subset of VADD, VREM, VEMB, and VSIM
Adds two extensions to these Redis commands:
- XB8 - which allows passing byte-values without conversion, joining FP32 (which is used for little-endian floats)
- XPREQ8 - a quantization option which takes in pre-quantized vectors, meant for use with XB8
Recovery, replication, and migration for Vector Sets
Hides all this functionality behind EnableVectorSetPreview/--enable-vector-set-preview

A more complete write up of the design, justifications, and remaining work is in vector-set.md.

There is still a lot of work to be done on Vector Sets, but this PR is at a point where it's functional enough to be played with - and there's merit to merging it so other work (Store V2, multi-threaded replication, etc.) isn't complicated.

The "big" things (besides commands and options) that are missing are:

Non-XPREQ8 quantizers - implementation here is on DiskANN
Variable length vector ids - likewise, support is coming in DiskANN, though some Garnet work will also be needed

…interface for now. Mostly interested in validating splaying things out into the MainStore, and the FFI since our likely integrations are non-C#.

…n be a placeholder until we get clever

…ther use, but demonstration

…de with main store keys'; uses magic bit patterns in trailing byte

badrishc · 2025-08-18T19:19:34Z

This all makes sense. In storage-v2, we have the notion of a LogRecord record format, that already includes optional fields such as ETag and Expiration. Adding a namespace would be analogous to this, with a couple of differences:

We use one RecordInfo bit to indicate whether the record has a namespace field
If true, then the NameSpace byte (or larger if needed) exists as part of the record, roughly: <RecordInfo, key, value, etag?, expiration?, namespace?, ...>
Tsavorite APIs are adjusted to accept the optional namespace (in addition to key/expiration/etag).
All hash codes and key equalities in Tsavorite would need to incorporate the namespace in their computations.
Certain namespaces (e.g., namespace numbers starting with "11") would be reserved for vector-set usage. Within that sub-namespace, the vector-sets can partition bits as they want.

TBD: how to handle larger namespace names in the same framework (e.g., "/user/foo/"), and is that useful/necessary to make as a first-class citizen versus users directly incorporating in the actual key.

…e with vector spaces

…nd in main

…h ideally we never hit this case)

prvyk · 2025-08-26T23:02:31Z

Adding a namespace would be analogous to this, with a couple of differences:

...

TBD: how to handle larger namespace names in the same framework (e.g., "/user/foo/"), and is that useful/necessary to make as a first-class citizen versus users directly incorporating in the actual key.

Namespaces can be used for a lot of things, like replacing the current numbered database implementation with something that is supported cluster-wide as in valkey (In that case they end up in the same AOF file I guess?). Also, quite a few RESP-accepting DBs have a namespace implementation of sorts, e.g. kvrocks. I had an idea of combining ACLs and database numbers (IMHO, this would be usually better than redis's prefix ACLs, because it doesn't require client to cooperate), and namespaces would be just as natural here.

…e recursive locking shenanigans; include element ids in benchmark data

libs/server/Resp/Vector/VectorManager.Locking.cs

…because of shared locking context, removing Tsavorite locks made the bug possible

libs/server/Resp/Vector/VectorManager.Locking.cs

libs/server/Resp/Vector/VectorManager.Callbacks.cs

tiagonapoli · 2025-12-04T12:56:35Z

libs/server/Storage/Functions/MainStore/VectorSessionFunctions.cs

+        #region RMW
+        /// <inheritdoc />
+        public int GetRMWInitialValueLength(ref VectorInput input)
+        => sizeof(byte) + sizeof(int) + input.WriteDesiredSize;


I didn't follow the additional sizeof(byte) + sizeof(int) here - why is it added?

It's namespace + ttl + actual data.

The TTL bits are unused currently but were kept because a) that better with how MainSessionFunctions so some helpers assume it's here b) we thought maybe TTL was going to be necessary.

In store v2 this should go away or least be "the same" as everything else since TTL and namespaces are first-class there.

Actually, thinking about this longer... I dunno that the byte makes sense at all - that may just be left over from earlier exploration and I conflated it with the namespace byte added later. I'll poke at it.

tiagonapoli · 2025-12-04T13:36:02Z

libs/server/Storage/Functions/MainStore/VectorSessionFunctions.cs

+        /// <inheritdoc />
+        public bool SingleReader(ref SpanByte key, ref VectorInput input, ref SpanByte value, ref SpanByte dst, ref ReadInfo readInfo)
+        {
+            Debug.Assert(key.MetadataSize == 1, "Should never read a non-namespaced value with VectorSessionFunctions");


For my learning: Just confirming - This also applies to all other functions right? RMW, Write, etc, are always namespaced for VectorSessionFunctions?

Correct, the assert should be duplicated onto appropriate paths - will fix.

libs/server/Storage/Functions/MainStore/VectorSessionFunctions.cs

tiagonapoli · 2025-12-04T21:08:22Z

libs/server/Resp/Vector/VectorManager.Callbacks.cs

+
+            ref var ctx = ref ActiveThreadSession.vectorContext;
+
+        tryAgain:


Just curious - why the choice for goto?

I was probably futzing about with an interlocked version at some point and kept the goto. It is a smell here, lemme replace that with a while(true).

tiagonapoli · 2025-12-04T21:32:29Z

libs/server/Resp/Vector/VectorManager.cs

+                }
+
+                // Actually delete the value
+                var del = storageSession.basicContext.Delete(ref key);


For my learning: what happens if a crash or something happens in the middle of TryDeleteVectorSet - That VectorSet will be corrupted? Or from the AOF, Garnet will be able to recover to a valid state?

For example, what happens if we delete a vector set, but fail to set ContextMetadata with the cleanUp flag because a restart happens first - is it possible that all DiskANN data will remain there and not cleaned up?

Maybe I'm missing something, but I was thinking if we should first RMW ContextMetadata with the cleanup flag to ensure that even upon restart we'll know that a particular context needs cleanup

That's a good catch - we will leave the Vector Set in a corrupted state if a crash happens between the RMW completing and the CleanupDroppedIndex completing.

The fix is a little trickier - just moving the ContextMetadata marking isn't enough. While that would let us resume a cleanup of element data after crashing (since incomplete cleanups are enqueued at process start), we wouldn't be able find the index or replication keys.

The RMW is what starts the delete process, so resumption of delete on first touch probably the path forward. Needs a good test, so gonna get that in place first before fixing.

libs/server/Storage/Session/MainStore/VectorStoreOps.cs

tiagonapoli · 2025-12-05T22:02:38Z

libs/server/Resp/Vector/VectorManager.Replication.cs

+            }
+
+            // Helper to complete read/writes during vector set synthetic op goes async
+            static void CompletePending(ref Status status, ref SpanByteAndMemory output, ref TContext context)


nit: looks like this function repeats 3 times on this file, should we have a common method?

Consequence of these all being in different files and then consolidated into VectorManager over time. Will DRY up.

tiagonapoli · 2025-12-05T22:08:12Z

libs/server/Resp/Vector/VectorManager.Replication.cs

+            keyWithNamespace.SetNamespaceInPayload(0);
+            key.AsReadOnlySpan().CopyTo(keyWithNamespace.AsSpan());
+
+            Span<byte> dummyBytes = stackalloc byte[4];


nit: looks like it's unused

vazois · 2025-12-05T23:10:02Z

libs/server/AOF/AofProcessor.cs

            this.storeWrapper = storeWrapper;

-            var replayAofStoreWrapper = new StoreWrapper(storeWrapper, recordToAof);
+            replayAofStoreWrapper = new StoreWrapper(storeWrapper, recordToAof);


maybe leave this as a local to prevent from people using it for any other purpose.

vazois · 2025-12-06T00:58:47Z

libs/server/Resp/Vector/VectorManager.Replication.cs

+                {
+                    // Allocate session outside of task so we fail "nicely" if something goes wrong with acquiring them
+                    var allocatedSession = obtainServerSession();
+                    if (allocatedSession.activeDbId != self.dbId && !allocatedSession.TrySwitchActiveDatabaseSession(self.dbId))


currently cluster supports a single database, we should validate that the VectorManager.dbId = 0

vazois · 2025-12-06T01:04:14Z

libs/server/Resp/Vector/VectorManager.Replication.cs

+                        throw new GarnetException($"Could not switch replication replay session to {self.dbId}, replication will fail");
+                    }
+
+                    self.replicationReplayTasks[i] = Task.Factory.StartNew(


I might have missed it, but we need a way to cancel these tasks in the event the replication stream breaks (i.e. failover).

kevin-montrose added 6 commits August 14, 2025 17:33

Baseline sketch for Vector ops - hiding algorithm provider behind an …

22f2646

…interface for now. Mostly interested in validating splaying things out into the MainStore, and the FFI since our likely integrations are non-C#.

properly distinguish vector set elements; this version stinks, but ca…

42ed3b9

…n be a placeholder until we get clever

horrible hacks for 'is a vector set' and 'is hidden'; I don't love ei…

12a9210

…ther use, but demonstration

some test fixes, before tearing out some of the worse ideas in here

91d34ce

horrible hack version of 'make sure vector _element_ keys don't colli…

8054fd0

…de with main store keys'; uses magic bit patterns in trailing byte

move SET to the vector hackery as a demonstration

c63006d

kevin-montrose added 14 commits August 19, 2025 15:31

Span-ify locking interface, these will become more important perf wis…

a9889f6

…e with vector spaces

cleanup parsing for VADD and VSIM to match Redis behavior

c9e7e37

prove out VDIM impl

02cd381

this is unnecessary

4f7cdba

get delete for vector sets working; fix VDIM

7135cb3

fix vdim test

8321302

fix comment

e0f5e1f

document expected VADD error behavior in a test, which currently fails

a564d85

fix more validation

5b975cc

fix sizing of distance buffer when calling into IVectorService

881ce84

optimisitic resize outputIds to avoid continuations

538da83

jank in proc benchmarks for vector options; this commit should not la…

dcc98be

…nd in main

blittable types for interop; bool is gone, byte* is gone

1a81eea

fix distinguish length in case where allocations are necessary (thoug…

a9f63e2

…h ideally we never hit this case)

kevin-montrose added 8 commits August 27, 2025 10:52

DiskANN as the provider

49f82ff

sketch out actually loading test data from somewhere

a7e0cbf

conditionally include the diskann bits if they're available

cc94513

needs impl

2c536f5

switch to object store for locking as a temporary work around for som…

b6f83e3

…e recursive locking shenanigans; include element ids in benchmark data

benchmarks run to completion

35f1b03

more realistic configs

f31140e

no suffix required here

6ac06d2