ChatGPT Survey - Performance on NLP datasets

The gist of this paper (I think, I saw this summary elsewhere) is that larger LLM datasets only slightly outperform smaller datasets.

Raw Words