Latest revision as of 17:13, 24 February 2017

Ideas

Curtis's note
- Comprehensive
- See note
Mark's blog
- Develop system for submitting and tracking large-volume simulation and reconstruction jobs
Matt's email
- Include data analysis system
- Develop/test system for delivering reconstruction data to data analyzers
David's email
- two data challenges: simulation and reconstruction
  - simulation: run the MC, reconstruct it, produce reconstructed data
  - reconstruction: create fake raw data sample, reconstruct it
- shipping reconstructed data to two institutions
- other specific proposals

one major problem to be solved is how to scale:
- how to generate and run thousands of jobs
- assess their status (before, during, and after they run)
- manage all output files and diagnostic data
- same issues for simulation and reconstruction: want a common framework
with this in place we can iterate in mini-data challenges
- wrong data mix?: change it
- wrong output format?: change it
- wrong photon reconstruction algorithm? change it
we want to be a position where re-running a mini-challenge is not big deal
in parallel develop everything else
- code correctness
- execution speed
- design and implement analysis system
- raw data generation
- planning for test bed for full data challenge
- reconstructed data format
find bottle-necks at intermediate scale
say by September 1 and every two weeks after that

another major problem: we don't have one
more of a design and development effort
can be fed by data from mini-challenges
- event format
- storage requirements/configuration
- data discovery
- user tools

@@ Line 1: / Line 1: @@
 =Ideas=
-* [http://argus.phys.uregina.ca/cgi-bin/private/DocDB/ShowDocument?docid=2031 Curtis's note]
+* [https://halldweb.jlab.org/doc-private/DocDB/ShowDocument?docid=2031 Curtis's note]
 ** Comprehensive
 ** See note