Replies: 3 comments
-
|
+1 |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
+1 (binding) |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
+1 binding |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This is the voting thread for adding the FM-Index scalar index type to the Lance format specification. See PR #7026 for the implementation.
What is FM-Index?
FM-Index is a compressed full-text index based on the Burrows-Wheeler Transform, following the Infini-gram Mini paper. It supports exact substring matching with index size ~0.44-0.95x of the text corpus.
Key features:
Format Changes
FMIndexIndexDetailsmessage toprotos/index.proto(empty message; all metadata stored in Lance file headers)FMIndex = 11variant toIndexTypeenumBenchmark (100K gitlake source code files, 1.59 GB text)
Voting Instructions
Please vote by commenting:
Beta Was this translation helpful? Give feedback.
All reactions