Search for a command to run...
Dataset contents This Zenodo record publishes a set of GEMC-produced HIPO files containing simulated detector response (and associated Monte Carlo truth where available) intended for: Track reconstruction development and validation (algorithm development, regression testing, performance studies) Data-preparation workflows for ML / foundation-model experiments Primary data format: HIPOPrimary generator/simulation toolchain: GEMC (Geant4-based), with configuration and job recipes tracked in the linked repository. How the data were produced (provenance) The HIPO files were generated by running GEMC with a controlled detector and physics configuration, producing simulated detector hits / digitization and exporting to HIPO for downstream reconstruction and analysis. Generation and configuration details are documented in: Software repository: https://code.jlab.org/cmorean/gemc-trackreco At minimum, the production procedure includes: Selection of geometry and detector configuration (TODO) Event generation (TODO) Transport and detector response simulation (Geant4 via GEMC) Digitization / hit formatting and export to HIPO Reproducibility note: This record is intended to be reproducible from the repository, pinned via commit hashes/tags and/or container images when available (TODO). File organization The deposition contains: *.hipo — simulated events in HIPO format Intended use This dataset is designed to support: Reconstruction algorithm development (including track finding/fitting, detector alignment studies, and performance comparisons) Controlled ML experiments where ground-truth labels and stable simulation conditions are required Data “readiness” studies for large-scale QCD-related ML pipelines, including provenance-aware dataset curation and repeatable training/evaluation splits It is not intended to represent a final physics result; it is a simulation-derived dataset intended for method development and evaluation under a known configuration.