Skip to content

[Feature Request] Add a small "document-to-KG conversion failure modes" note (docs only) #43

@onestardao

Description

@onestardao

Hi AutoSchemaKG team,

Thank you for releasing AutoSchemaKG. Automatically converting documents into schema and knowledge graphs is a powerful upstream component for many RAG systems.

I have been working on failure-mode maps for RAG and data pipelines and recently contributed a robustness-related entry to Harvard MIMS Lab’s ToolUniverse. In document-to-KG workflows, I often see recurring problems:

  • important entities or relations are dropped during conversion
  • schema choices introduce hidden biases in what can be represented
  • downstream RAG systems fail in ways that are hard to trace back to KG conversion errors

I would like to propose a small, documentation-only failure-mode note for AutoSchemaKG.

Proposed feature

Add a short markdown page named for example:

doc_to_kg_conversion_failure_modes.md

The page could:

  1. List typical failure modes when converting documents to KGs:
    • missing entities or relations
    • incorrect grounding
    • schema gaps.
  2. For each, describe:
    • symptoms users might see
    • likely causes (parsing, mapping, schema design)
    • basic diagnostics or sanity checks.
  3. Provide a short checklist for users who plan to feed the resulting KG into RAG systems.

Motivation

  • AutoSchemaKG is an upstream component for retrieval and reasoning systems.
  • A small failure-mode note would help users interpret downstream errors and design better schemas and checks.
  • This is a docs-only change and can be gradually refined.

If this seems useful, I would be happy to draft a concise version in a PR.

Thank you for considering.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions