Search for a command to run...
Abstract Large-scale biodiversity monitoring is often inhibited by taxonomic obstacles. While deep learning has demonstrated efficacy in species identification, the increasing reliance on large Vision Transformers (ViTs) creates computational barriers that restrict usage to cloud-based infrastructure. Recent foundation models, such as BioCLIP and the Insect-1M framework, require parameter counts exceeding 100M, rendering them unsuitable for edge deployment in field operations. This study presents Elytra 1.0, a computer vision model optimized for edge deployment and capable of classifying 3,127 common North American insect species. The dataset, comprising 2.6 million images, includes all insect species in North America with over 1,000 research-grade observations on iNaturalist. An EfficientNet-B0 architecture was trained using transfer learning from ImageNet with adaptive learning rate scheduling. The model achieved 91.27% Top-1 Accuracy and 97.6% Top-5 Accuracy on an internal test set (N=289,151 images). To rigorously evaluate generalization beyond photographer-specific patterns, an independent observer-excluded test set (N=5,780 images, 578 species) was constructed comprising images exclusively from photographers who contributed zero training data. A post-hoc spatiotemporal audit revealed this test set was heavily skewed toward the Neotropics (Mean Lat: 6.05° N) during the boreal winter (Dec 2025–early Feb 2026). Despite this significant biogeographic and phenological shift from the predominantly temperate training data, the model achieved 86.68% Top-1 Accuracy (95% CI: 85.8–87.5%). This confirms that Elytra 1.0 relies on robust morphological features rather than learning background environmental correlations, maintaining high performance even in novel ecological contexts.The resulting model file size is 30 MB with an inference speed exceeding 700 frames per second (FPS) on mobile hardware. These results indicate that optimized convolutional architectures can achieve competitive accuracy with server-grade transformers while remaining suitable for decentralized, offline monitoring applications.