Deep Learning
Neural Architecture Search
Every neural network architecture deployed at scale - ResNet, EfficientNet, MobileNet, BERT - was designed by human experts over months of iteration. Neural Architecture Search asks: what if a machine could search the architecture space automatically? In 2017, Google's NAS found a cell architecture better than any human-designed baseline, at the cost of 800 GPUs for 28 days. By 2020, one-shot NAS achieves the same quality in hours. Today, every major ML framework (AutoML, Google Vertex AI, AWS AutoPilot) includes NAS capabilities, and hardware companies like Qualcomm use NAS to find optimal architectures for each new chip design.
- **Google AutoML** uses NAS (specifically Neural Architecture Search based on EfficientNet's compound scaling) as the backbone of its Vertex AI AutoML service, automatically finding architectures for customer image classification tasks in hours instead of months of manual engineering.
- **Apple's Neural Engine** chip architecture was partially co-designed with NAS: Apple uses hardware-aware NAS to find neural network architectures that map optimally to the Neural Engine's memory hierarchy, achieving 5x better performance/watt than GPU-based inference.
- **Qualcomm's AI Model Efficiency Toolkit (AIMET)** uses NAS combined with quantization search to find architectures for Snapdragon's Hexagon DSP - enabling on-device AI at 200 TOPS/watt efficiency in 2024 flagship phones.
When a controller learned to design networks
Barret Zoph and Quoc Le at Google Brain launched modern NAS in 2016-2017 with 'Neural Architecture Search with Reinforcement Learning': an RNN controller proposed architectures, trained them, and used validation accuracy as a reward signal, all at the cost of hundreds of GPUs running for weeks. The brute-force expense pushed the field toward efficiency. Liu, Simonyan, and Yang introduced DARTS in 2018, relaxing the discrete search into a differentiable one and cutting the cost by orders of magnitude. Tan and Le's EfficientNet (2019) then showed that a small NAS-found base plus principled compound scaling beats hand-tuned giants.