For rebuttal, add UIE model support in CV evaluation and corresponding argument by Ki-Seki · Pull Request #100 · SculptAI/GIMBench

Ki-Seki · 2026-04-10T18:08:42Z

No description provided.

Copilot

Pull request overview

Adds PaddleNLP UIE (Taskflow) as an additional extraction backend for the CV parsing evaluator, selectable via a new CLI flag, to support rebuttal-focused CV evaluation runs.

Changes:

Introduces UIEEvaluator that runs PaddleNLP Taskflow information_extraction over CV_FIELDS.
Adds --use_uie CLI flag and wires evaluator selection to prefer UIE when enabled.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
`src/gimbench/cv/evaluators.py`	Adds `UIEEvaluator` and updates `conduct_eval` to select it via `args.use_uie`.
`src/gimbench/arguments.py`	Adds `--use_uie` flag to CV evaluation arguments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-10T18:11:23Z

+        except Exception as e:
+            logger.error(f"PaddleNLP UIE generation failed: {e}")
+            extraction = dict.fromkeys(CV_FIELDS, "")
+        return extraction


UIEEvaluator._extract_fields catches all exceptions and returns an all-empty extraction. This means failures won’t propagate to _evaluate_item, so error_msg stays empty and the run will count these as normal (incorrect) items instead of errors (and won’t be excluded by _filter_non_error_items). Consider letting the exception propagate (or re-raising after logging) so extraction failures are tracked consistently with other evaluators.

Copilot · 2026-04-10T18:11:23Z

+        except Exception as e:
+            logger.error(f"PaddleNLP UIE generation failed: {e}")
+            extraction = dict.fromkeys(CV_FIELDS, "")


This error path logs with logger.error(...), which drops the stack trace. If you keep handling exceptions here, prefer logger.exception(...) (or include exc_info=True) so UIE failures are debuggable in logs.

Copilot · 2026-04-10T18:11:23Z

 def _add_cv_eval_args(parser):
    parser.add_argument("--use_outlines", action="store_true", help="Whether to use outlines in CV evaluation")
+    parser.add_argument("--use_uie", action="store_true", help="Whether to use Traditional UIE model in CV evaluation")
    parser.add_argument(


--use_outlines and --use_uie both select the CV extractor, but they can be enabled together. Since conduct_eval silently prioritizes UIE when both are set, it’d be clearer to make these flags mutually exclusive (argparse mutually-exclusive group) or validate and error when both are provided.

Copilot · 2026-04-10T18:11:24Z

 def conduct_eval(args: Namespace, ds: Dataset):
-    evaluator = OutlinesEvaluator(args, ds) if args.use_outlines else GIMEvaluator(args, ds)
+    if hasattr(args, "use_uie") and args.use_uie:
+        evaluator = UIEEvaluator(args, ds)
+    else:
+        evaluator = OutlinesEvaluator(args, ds) if args.use_outlines else GIMEvaluator(args, ds)


If both use_uie and use_outlines are set, UIE wins due to this conditional, but that precedence isn’t communicated to the user. Either enforce mutual exclusivity during arg parsing/validation, or emit a clear warning/error here to avoid surprising evaluator selection.

Add UIE model support in CV evaluation and corresponding argument

cbbc273

Copilot AI review requested due to automatic review settings April 10, 2026 18:08

Copilot started reviewing on behalf of Ki-Seki April 10, 2026 18:09 View session

Copilot AI reviewed Apr 10, 2026

View reviewed changes

Ki-Seki added the do not merge label Apr 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For rebuttal, add UIE model support in CV evaluation and corresponding argument#100

For rebuttal, add UIE model support in CV evaluation and corresponding argument#100
Ki-Seki wants to merge 1 commit intomainfrom
rebuttal/uie-model

Ki-Seki commented Apr 10, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 10, 2026

Uh oh!

Copilot AI Apr 10, 2026

Uh oh!

Copilot AI Apr 10, 2026

Uh oh!

Copilot AI Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Ki-Seki commented Apr 10, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants