brucethemoose
@brucethemoose@lemmy.world
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
The limit is already (apparently) starting to be data… and capital, lol.
There could be a big computer breakthrough like , say, fast bitnet training that makes the multimodal approach much easier to train though.
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
Well for one, I directly disagree with Altman’s fundamental proposition, they don’t need to “scale” AI so dramatically to make it better.
See: Qwen 2.5 from Alibaba, a fraction of the size, made with a tiny fraction of the H100 GPUs and highly competitive (and (mostly) Apache licensed). And frankly, OpenAI is pointedly ignoring all sorts of open research that could make their models notably better or more powerful efficient, even with the vast resources and prestige they have… they seem most interested in anticompetitive efforts to regulate competitors that would make them look bad, using the spectre of actual AGI (which has nothing to do with transformers LLMs) to scare people.
Even if doing it for the wrong reasons, I feel like Google would be right to oppose Mozilla axing the nonprofit division if they were somehow in a similar position to OpenAI. Their mission of producing a better, safer browser would basically be lying through their teeth.
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
And in that case, will the Llama fork be the same as the Meta fork? We are talking about AI that has a considerable development, companies would probably not participate because it is not an open source license and its clause limits in those aspects.
Llama has tons of commercial use even with its “non open” license, which is basically just a middle finger to companies the size of Google or Microsoft. And yes, companies would keep using the old weights like nothing changed… because nothing did. Just like they keep using open source software that goes through drama.
Also you have to think that if the new version of Llama with the new license is 3 times better than Llama with the previous license, do you really think that the community will continue to develop the previous version?
Honestly I have zero short term worries about this because the space is so fiercely competitive. Also much of the ecosystem (like huggingface and inference libraries) is open source and out of their control.
And if they go API only, honestly they will just get clobbered by Google, Claude, Deepseek or whomever.
In the longer term… transformers will be obsolete anyway.
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
A small startup called Arcee AI actually “distilled” logits from several other models (Llama, Mistral) and used the data to continue train Qwen 2.5 14B (which itself is Apache 2.0). It’s called supernova medius, and it’s quite incredible for a 14B model… SOTA as far as I know, even with their meager GPU resources.
A company called upstage “expands” models to larger parameter counts by continue training them. Look up the SOLAR series.
And quite notably, Nvidia continue trained Llama 3.1 70B and published the weights as Nemotron 70B. It was the best 70B model for awhile, and may still be in some areas.
And some companies like Cohere continuously train the same model slowly, and offer it over API, but occasionally publish the weights to promote them.
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
Call them ClosedAI like the community does, and it’s much easier, lol.
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
Also competition is stiff. Alibaba is currently handing their butts to them with Qwen 2.5. Deepseek (a Chinese startup), tencent and Mistral (French) are giving them a run for their money too, and there are even some that “continue train” their old weights.
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
No, they can’t, because you can just pull the git repo with the old license as use them as they were at the time of upload, just like any software on a git repository. And too many people have them downloaded to delete them from the internet.
There are also finetunes inheriting the old license, and those orga are not going to pull the weights.
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
It’s not open source, but open weights, documented, relatively permissively licensed and all the inference/finetuning libraries for it are open source.
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
They don’t use OpenAI at all.
- Comment on Meta asks the US government to block OpenAI’s switch to a for-profit 1 week ago:
I know Meta has some heavy sins, but they’ve pushed ‘open’ ML a long time. They developed, and continue to fund PyTorch, and they basically standardized the open LLM architecture with Llama to the point literally everyone uses it almost unmodified now, just to name two examples.
They also have a commercial interest in their open weight model ecosystem succeeding over OpenAI’s completely closed models and research. And TBH they have a good shot, as OpenAI really seem to have stagnated.
Also, Altman is a straight up con artist. I wouldn’t be surprised if Facebook employees hate him.
- Comment on Nintendo shuts down Ryujinx 2 months ago:
Twitter screenshot of this linked in slack that evening.
The modern internet in a nutshell, lol.
- Comment on Nintendo shuts down Ryujinx 2 months ago:
Discord is even worse, as you need to find an invite to a specific Discord, and sometimes go through a lengthy sign up process for each Discord.
Some won’t let you sign up without a phone #.
- Comment on Nintendo shuts down Ryujinx 2 months ago:
Matrix.
And… Lemmy.
It doesn’t matter though, the problem is the critical mass is migrating to Discord and shunting everything out of view.
- Comment on Nintendo shuts down Ryujinx 2 months ago:
I’m a bit bitter this was apparently only announced through discord?
The future of social media is fragmented siloes, I guess.
- Comment on Astronomers discover technique to spot AI fakes using galaxy-measurement tools 4 months ago:
Simple, but quite clever.
Theoretically they could finetune for consistent eye reflections, but even then I suspect image models wouldn’t get it right. Physical correctness is not really something they actually do.