Data Ingestion Layer

The Data Ingestion Layer functions as the secure gateway into Orn’s pipeline. It is responsible for validating that every submission entering the system is authentic, compliant with technical requirements, and properly logged for downstream processing. At this stage, the focus is not on polishing or labeling videos, but rather on ensuring that only valid, trustworthy material is admitted into the ecosystem.

Every uploaded video is immediately standardized into a consistent format, regardless of the recording device. Baseline checks verify that submissions meet minimum technical thresholds such as frame rate, resolution, and duration. Smart glasses typically produce video between twenty-nine and thirty-one frames per second at a minimum of 720p resolution, with 1080p as the expected norm. Videos that fall outside of these ranges, or that are abnormally short or excessively long, are rejected before they can consume resources downstream.

Alongside these technical validations, the ingestion layer performs metadata inspection. Device identifiers, frame rate records, and encoding profiles are compared against known patterns to confirm that videos were generated by smart glasses rather than smartphones or external sources. Any attempt to tamper with metadata—for example, manually altering FPS values to bypass restrictions—is automatically detected, and such submissions are discarded.

To maintain integrity across the dataset, the ingestion system also incorporates light deduplication at this early stage. Basic hashing and frame-level fingerprinting prevent users from uploading identical copies of the same video in an attempt to double-claim rewards. More advanced duplicate detection occurs later in the pipeline, but this first pass acts as a fast filter against obvious abuse.

Finally, each accepted video is temporarily buffered for validation and paired with a structured metadata record (device type, resolution, frame rate, anonymized user ID, and task identifier). Permanent storage occurs only after pre-processing, once all privacy filters and anonymization have been applied, ensuring no raw or sensitive footage is retained. This ensures that every submission entering Orn is properly indexed and traceable from the outset, without compromising contributor privacy.

By design, the Data Ingestion Layer is deliberately narrow in scope: its purpose is to act as the gatekeeper. Videos that clear this stage are guaranteed to be authentic, device-compliant, and technically valid, setting the stage for deeper refinement and transformation in the subsequent Pre-Processing, Annotation, and Post-Processing layers.

All thresholds, parameters, and detection methods described are subject to continuous refinement as technology advances and as the requirements of the ecosystem evolve.

Last updated