can't wait for someone to distill gpt2 into a network half the size but almost as good


cant wait for language model asics, I actually think they have the potential to be extremely useful

