Add `ql:has-word` triples to internal PSO&POS permutation #2579

hannahbast · 2025-12-06T01:44:49Z

During parsing, for each triples with a literal object, and for each word in that literal, add an internal triple subject ql:has-word "word". These can be used for highly customized text search. To make this efficient, materialized views can be used.

TODO: This is currently done unconditionally, which makes it easier to test (we don't need special options in the Qleverfile). Eventually, there should be an option --add-has-word-triples to IndexBuilderMain to enable this behavior. Tests are also still missing

During parsing, for each triples with a literal object, and for each word in that literal, add an internal triple `subject ql:has-word "word"`. TODO: This is currently done unconditionally, which makes it easier to test (we don't need special options in the Qleverfile). Eventually, there should be an option `--add-has-word-triples` to `IndexBuilderMain` to enable this behavior. Tests are also still missing

codecov · 2025-12-06T18:58:39Z

Codecov Report

❌ Patch coverage is 98.18182% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 91.20%. Comparing base (50d08bc) to head (6dcd451).

Files with missing lines	Patch %	Lines
src/index/IndexBuilderTypes.h	97.43%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #2579   +/-   ##
=======================================
  Coverage   91.20%   91.20%           
=======================================
  Files         473      473           
  Lines       40233    40264   +31     
  Branches     5378     5386    +8     
=======================================
+ Hits        36695    36724   +29     
- Misses       2006     2007    +1     
- Partials     1532     1533    +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Writing the position is more general. But computing the term frequencies for each text-word pair is currently not efficient in QLever (it requires too much memory and a GROUP BY with two variables is much slower than a GROUP BY with one variable). Since we never needed positions so far, but we do want term frequencies for scoring, let's make this the default for now.

This complements ad-freiburg/qlever#2579

sparql-conformance · 2025-12-07T02:16:28Z

Overview

Number of Tests	Passed ✅	Intended ✅	Failed ❌	Not tested
525	379	67	79	0

Conformance check passed ✅

No test result changes.

Details: https://qlever.dev/sparql-conformance-ui?cur=6dcd451a4878273ffb816ba5255d33023376610c&prev=50d08bc03a4c1fe7495d4209a3542c90ce36b997

sonarqubecloud · 2025-12-07T03:13:35Z

Quality Gate passed

Issues
6 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Hannah Bast added 5 commits December 6, 2025 02:41

The subject of the ql:has-word triple should be the literal

f0389da

Add option --add-has-word-triples to IndexBuilderMain

5113120

Fix failing unit tests

75cf0c7

Add word positions (as graph) and log number of triples added

961377d

hannahbast mentioned this pull request Dec 7, 2025

Add option HAS_WORD_TRIPLES for qlever index command qlever-dev/qlever-control#221

Merged

hannahbast added a commit to qlever-dev/qlever-control that referenced this pull request Dec 7, 2025

Add option HAS_WORD_TRIPLES for qlever index command (#221)

ad083e0

This complements ad-freiburg/qlever#2579

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `ql:has-word` triples to internal PSO&POS permutation #2579

Add `ql:has-word` triples to internal PSO&POS permutation #2579

Uh oh!

hannahbast commented Dec 6, 2025 •

edited

Loading

Uh oh!

codecov bot commented Dec 6, 2025 •

edited

Loading

Uh oh!

sparql-conformance bot commented Dec 7, 2025

Uh oh!

sonarqubecloud bot commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add ql:has-word triples to internal PSO&POS permutation #2579

Are you sure you want to change the base?

Add ql:has-word triples to internal PSO&POS permutation #2579

Uh oh!

Conversation

hannahbast commented Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

sparql-conformance bot commented Dec 7, 2025

Overview

Conformance check passed ✅

Uh oh!

sonarqubecloud bot commented Dec 7, 2025

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add `ql:has-word` triples to internal PSO&POS permutation #2579

Add `ql:has-word` triples to internal PSO&POS permutation #2579

hannahbast commented Dec 6, 2025 •

edited

Loading

codecov bot commented Dec 6, 2025 •

edited

Loading