Meta leaks on torrent /
Amazon partners with Hugging Face
“Do not go where the path may lead; go instead where there is no path and leave a trail.” This anonymous quote seems to have given Meta the idea to put its Large Language Model Meta AI (LLaMa) online for anyone to download.
What’s happening?
The Large Language Model (LLM), which is usually only supposed to be available to a select body of approved researchers, government officials, and members of civil society, was shared on 4Chan. This is the first time a major tech company’s proprietary AI model has leaked to the public. Many people blame Meta for the leak as access to the model was allowed to people (with a .edu email address) who applied through a Google form.
A little bit about LLaMa’s?
LLaMa is a family of language models that are trained on more than 1 trillion tokens of data from sources including WikiPedia, GitHub, CommonCrawl, and a host of others. There are 4 sizes available with 7B, 13B, 33B, and 65B parameters. These models have performed greatly on zero-shot reasoning tasks and other benchmarks but not so well (yet) on Massive Multitask Language Understanding (MMLU) tasks.
Why’s the leak a big deal?
There is worry in the AI community that too many AI models without checks in the public domain will lead to the end of the world. This might happen because these models have unknown capabilities, including dangerous features that bad-faith actors could discover and use for nefarious activities. This potentially makes AI governance hard because there will be too many different kinds, which will be hard to control.
In the past, it took a long time between when AI features were made and when they were made available to the public. This is because these features are usually made by a small group of experts, and putting them into larger systems takes a lot of time and money. On the other hand, by making LLaMa available to the public, Meta is shortening the time between the development of cutting-edge features like those in GPT3 and their spread to the general public.
What’s next for LLaMa?
While Meta has not denied the leak, in a statement to Motherboard, a spokesperson wrote, “It’s Meta's goal to share state-of-the-art AI models with members of the research community to help us evaluate and improve those models. LLaMA was shared for research purposes, consistent with how we have shared previous large language models. While the model is not accessible to all and some have tried to circumvent the approval process, we believe the current release strategy allows us to balance responsibility and openness.”
In the meantime, it looks like Meta is asking for the model to be taken down online through takedown requests to stop it from spreading. |