The Illustrated Retrieval Transformer
The latest batch of language models can be much smaller yet achieve GPT-3 like performance by being able to query a database or search the web for information. A key indication is that building larger and larger models is not the only way to improve performance.
Source: The Illustrated Retrieval Transformer, an article by Jay Alammar.