Skip to content

Processing Dashboard

This dashboard exposes the working state of the archive. It is generated from the processed catalogs and then summarized for readers here. The counts below describe custody and review state; they do not mean every OCR-derived candidate is correct.

The generated Source Text Browser now exposes 304 processed text sections across the corpus. These reader pages are pagefind-disabled so broad OCR/PDF text coverage does not swamp curated search results.

The generated Chapter Workbench now exposes 304 section-level research maps across the corpus, joining source text links with theme snippets, concept/glossary hits, candidate equations, candidate figures, quote candidates, and promotion checklists.

The generated Concept Concordance now traces 77 curated terms and concepts across all 304 processed sections, linking each hit back to the generated source text and chapter workbench.

The generated Completion Audit now measures source-by-source readiness for canonical review. The expert finishing standard is recorded in World-Class Completion Criteria.

The generated Citation And Data Export page publishes a public data manifest, reusable JSON exports, BibTeX, CSL JSON, and citation guidance. These exports keep review-state fields visible so external reuse does not flatten candidates into verified claims.

The next scholarly controls are also public: Notation Ledger, Diagram Provenance Ledger, Schema Reference, Expert Review Packets, Release Levels, Accessibility Audit, Edition Comparison Layer, Patent To Theory Bridge, Canonical Verification Workbench with equation, figure, and patent scan-check queues, and the Claim Attribution Ledger for source-isolated fact/candidate/translation/interpretation layers.

SourceChaptersEquation CandidatesFigure CandidatesOriginal CropsReview State
Radiation, Light and Illumination13300985First canonical source with scan crops, OCR, lecture splits, concepts, equations, and diagram anchors.
Elementary Lectures on Electric Discharges, Waves and Impulses10300160Candidate lecture map for discharge, wave, impulse, and electric-field language.
Engineering Mathematics630090Mathematical support source for notation, series, exponentials, complex quantities, and empirical curves.
Theory and Calculation of Alternating Current Phenomena373001454Main AC corpus for symbolic method, impedance, reactance, admittance, harmonics, and transformers.
Theory and Calculation of Transient Electric Phenomena and Oscillations5830016Major transient corpus with first original crops and condenser-response redraw support.
Theoretical Elements of Electrical Engineering114300100Source-specific section map for magnetism, E.M.F., induction, field language, apparatus, motors, and transformers.
General Lectures on Electrical Engineering17134140Ordinal lecture parser seeded power-system, harmonics, surge, lightning, railway, and lighting entry points.
America and the New Epoch182100Historical and social-worldview source with introduction and chapter candidates; kept separate from electrical-theory claims.
Theory and Calculation of Electric Apparatus22300130Apparatus-theory source for motors, transformers, regulation, heating, losses, and harmonics.
Four Lectures on Relativity and Space4170190Late source for relativity, space, gravitation, and field geometry, handled with strict interpretation boundaries.
Commonwealth Edison Generating System Trouble522010Embedded PDF text extracted; report sections, page map, equations, concepts, glossary, quotes, annotations, and crosslinks generated.

The pipeline writes machine-readable indexes in processed/:

  • research_index.json for source custody and processing state.
  • equation_index.json for OCR-derived equation candidates.
  • figure_index.json for candidate figure records and promoted original crops.
  • glossary_index.json for seeded historical terminology.
  • concept_index.json for cross-source concept occurrence counts.
  • quote_index.json for hidden-gem quote candidates.
  • annotations_index.json for generated review notes and next-action annotations.
  • crosslinks_index.json for source-to-concept, source-to-term, source-to-equation, and source-to-figure navigation aids.
  • evidence_ledger.json for 3,345 source, concept, glossary, equation, figure, quote, and promoted-crop traceability records.
  • chapter_atlas.json for 304 chapter/lecture/report-section theme routing records across the processed corpus.
  • chapter_workbench.json for 304 section-level research workbench records across the processed corpus.
  • concept_concordance.json for source-linked concept hits across the processed corpus.
  • completion_audit.json for source-by-source infrastructure readiness and next canonical-review actions.
  • canonical_equations.json for the first twelve-equation canon and verification state.
  • citation_index.json, citation_index.csl.json, and citation_index.bib for project and source citation exports.
  • notation_ledger.json for source and modern equation-symbol review.
  • diagram_provenance_ledger.json for original crop and modern redraw provenance.
  • schema_reference.json for descriptive export schemas and review-state fields.
  • expert_review_packets.json for routed review bundles by expertise.
  • release_readiness.json for named public release levels.
  • accessibility_audit.json for structural accessibility checks and manual gates.
  • edition_comparison_index.json for edition-collation review.
  • patent_theory_bridge.json for patent-to-concept and patent-to-theory review.
  • canonical_verification_workbench.json for the top-level equation, figure, and patent verification queue index.
  • equation_verification_queue.json for OCR snippets and scan-check actions around the first equation canon.
  • figure_verification_queue.json for original scan crop review cards.
  • patent_verification_queue.json for authority PDF, claims, drawing, and theory-bridge review.
  • claim_attribution_ledger.json for source-isolated claim types, interpretation layers, confidence, and allowed use.
  • Public copies of selected indexes under /data/, described by /data/manifest.json.
  • source_processing_status.md for the plain Markdown dashboard.
  • Candidate means the pipeline found something worth review.
  • Promoted scan crop means the asset was cropped from an original source-page render and has a manifest plus checksum.
  • Canonical will be reserved for source-checked, page-anchored analysis.
  • Interpretive reading remains separate from source fact, modern engineering translation, and mathematical reconstruction.
  • Reusable export means the data can be cited or consumed by tools, but its embedded review status still controls what can be treated as verified.

The next high-value milestone is not more decoration. It is to work through the generated verification queues: scan-check selected equations, correct OCR around them, review promoted figure crops, extract patent claims and drawings, crop the Commonwealth appendix figure, and publish source-grounded explanations with original notation preserved beside modern notation.