Small Language Models · Hugging Face
DistilBERT: A Smaller and Faster Version of BERT
DistilBERT turns knowledge distillation for compact language models into a concrete research object, with evidence anchors, method tradeoffs, and limits for practical use.