[codex] Batch selected OpenNeuro downloads#283
Merged
Conversation
✅MegaLinter analysis: Success
Notices📣 MegaLinter 9.5.0 is out! Discover the new features and security recommendations in the release announcement. (Skip this info by defining See detailed reports in MegaLinter artifacts
|
Member
Author
|
Update after the adaptive downloader fix (
Conclusion: this PR recovers the ds004330 engineering path through download/stage/decode, but the small-batch scientific signal is too weak to justify claiming ds004330 as a second positive OpenNeuro dataset or spending full-cohort compute right now. I would merge/keep the downloader hardening if useful, then pause ds004330 until the ds006629 response-window result is packaged. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Summary
Attempts to recover the ds004330 OpenNeuro engineering path by replacing the per-include
openneuro-pyloop with a bounded-batch downloader.The previous ds004330 full sharded run failed before staging/decoding: each shard attempted hundreds of separate
openneuro-py download --include ...invocations and one representative job aborted withdouble free or corruption (fasttop)/ exit code 134 during raw FIF download. This patch keeps the same exact include selection, but passes repeated--includearguments in bounded batches, lowers OpenNeuro download concurrency, writes the include manifest, and retries after failed batches by re-checking which files remain missing.Validation
PYTHONPATH=src python -m pytest tests\test_openneuro_meg.py tests\test_openneuro_ds004330_workflow.py tests\test_openneuro_resilient.pyPYTHONPATH=src python -m py_compile src\neureptrace\openneuro_meg.py.github/workflows/openneuro-meg-loso.ymlas YAMLgit diff --checkSmoke Run
I dispatched a ds004330 smoke run from this branch to verify the recovered path gets past the old download failure:
https://github.com/IPS-Stuttgart/NeuRepTrace/actions/runs/27114344429
This smoke is engineering validation only (
subjects=1,2,runs=01,02,03, real labels). It is not the full paper-level ds004330 real-vs-shuffle result.