The Internet is 44 Terabytes?!
It presents a surprising fact that the vastness of the internet, when filtered for text, is a manageable 44 terabytes.
Caption You think the internet is HUGE? 🤯 The *text* data for LLMs is only 44TB. Mind blown! #AI #LLM #DataScience