Skip Navigation

Posts
38
Comments
956
Joined
2 yr. ago

  • ah, I stand corrected! the figures I was looking at previously were for doing it at acceptable speeds in a data center.

    can you imagine the intensity of the RGB in the boy genius Prompt Engineer’s new $6000 custom top end gaming PC with server components? maybe they’ll have the LLM slowly plagiarize them a Python script that turns on more RGB when the GPU’s under load.

  • Is the R1 model better than all existing models? Well, it benchmarks well. But everyone trains their models to the benchmarks hard. The benchmarks exist to create headlines about model improvements while everyone using the model still sees lying slop machines. No, no, sir, this is much finer slop, with a bouquet from the rotting carcass side of the garbage heap.

    […]

    This crash doesn’t mean AI sucks now or that it’s good now. It just means OpenAI, and everyone else whose stock dipped, was just throwing money into a fire. But we knew that.

    Slop generators are cheap now, and that’s a sea change — but the output is still terrible slop, just more of it.

    this bares repeating. I’ve seen quite a few people declare that DeepSeek fixes all of the issues with LLMs as a technology, but that just isn’t true. a DeepSeek LLM is still an unreliable plagiarism machine with no known use case trained on massive amounts of stolen data, even if OpenAI and other American ghouls were the ones who did the theft in the first place.

    there’s a small victory in that Altman and friends were exposed very publicly as lying grifters, and that’s worth celebrating. but it’s very important to not get swept up in a hype wave, especially one crafted by people who are much more competent at managing public opinion than Altman & co. from what I understand: no, this thing isn’t meaningfully open source. no, you can’t run the good version at home. sure, it performs great at the benchmarks we know were designed to be cheated. yeah, DeepSeek LLMs are probably still an environmental disaster for the same reason most supposedly more efficient blockchains are — perverse financial incentives across the entire industry.

    but hey, good news for the boy genius Prompt Engineer at your company: he gets to requisition another top end gaming PC, absolutely drowning in RGB, to run the shit version of DeepSeek on. maybe in a couple months he can spin switching from OpenAI’s rentseeking to a DeepSeek LLM startup’s slightly cheaper rentseeking into a mild pay bump.

    e: see david’s reply, I’m wrong about not being able to run the full version at home — but you need $6000 of fairly specific hardware and it’s molasses slow

  • I love both the content of this post and the fact that it’s a self-contained torture test for our pict-rs upgrade

    also, lol @ musk, war genius, starting a domestic dispute with his ex-girlfriend cause she dared to betray him in his baby mobile 4x game when betrayals are a core part of every 4x I know

    I’m getting the strong mental image of musk being the guy who flips the board 12 hours into Twilight Imperium cause the other players didn’t let him win

  • from what I’ve been told, a digital nomad visa and EU citizenship by descent are a couple of routes worth looking into. I have frustratingly little detail on the expectations around the visa though, and citizenship by descent laws vary by country.

  • then I’d tell it to shove itself into a fucking locker, that’s what

  • oh cool, the logo’s just a barely modified sparkle emoji so you know it’s horseshit, and it’s directly funded by Scale AI and a Rationalist thinktank so the chances the models weren’t directly trained on the problem set are vanishingly thin. this is just the FrontierMath grift with new, more dramatic, paint.

    e: also, slightly different targeting — FrontierMath was looking to grift institutional dollars, I feel. this one’s designed to look good in a breathless thinkpiece about how, I dunno…

    When A.I. Passes This Test, Look Out

    yeah, whatever the fuck they think this means. this one’s designed to be talked about, to be brought up behind closed doors as a reason why your pay’s being cut. this is vile shit.

  • gonna start referring to awful.systems like how a twitch streamer refers to chat

  • oh boy: https://social.wake.st/@liaizon/113868769104056845 iOS devices send the contents of Signal chats to Apple Intelligence by default

    e: this fortunately doesn’t seem to be accurate; excuse my haste. here’s the word from the signal forums

  • Chuds keep posting pictures of Democratic Party politicians (particularly Kamala Harris) with their arm raised

    of course they are. there’s no convincing these fuckers because they’re collaborators looking to strengthen the conviction of other collaborators by any inane means necessary.

  • do you figure it’s $1000/query because the algorithms they wrote with their insider knowledge to cheat the benchmark are very expensive to run, or is it $1000/query because they’re grifters and all high mode does is use the model trained on frontiermath and allocate more resources to the query? and like any good grifter, they’re targeting whales and institutional marks who are so invested that throwing away $1000 on horseshit feels like a bargain

  • holy shit, that’s the excuse they’re going for? they cheated on a benchmark so hard the results are totally meaningless, sold their most expensive new models yet on the back of that cheated benchmark, further eroded the scientific process both with their cheating and by selling those models as better for scientific research… and these weird fucks want that to be fine and normal? fuck them

  • absolutely; there’s no reason to hide the funding source and OpenAI’s access unless you’re grifting. I feel bad for the mathematicians working on FrontierMath who didn’t know though. imagine wasting valuable time on something like this then finding out it was all just a marketing stunt devised by grifters.

  • Besiroglu says OpenAI did have access to many of the FrontierMath problems and solutions — but he added “we have a verbal agreement that these materials will not be used in model training.”

    ooh, a verbal agreement! incredible! altman & co didn’t even have to do the typical slimy corporate move and pay an intern to barely modify the original materials into the input for the training corpus, since that verbal agreement wasn’t legally binding and behind the scenes OpenAI can just go “oopsy woopsy we swear it won’t happen again” and who’s gonna stop them?

  • it should be fixed… again. for some reason our image cache keeps getting into a state where it either stops accepting uploads or stops accepting requests at all. I plan to upgrade us to the latest version soon, but it’ll unfortunately involve a little bit of downtime: to upgrade pict-rs to a new point release, you have to run the migrate command, but it only works for the previous release. we’re two releases behind, so I have to custom package the in-between release just to get us there.

  • holy fuck those comments. are all these people huffing CO2?

    I get the some streamers looked at @elonmusk's gameplay and it looks like a shared account, maybe with his kids or something, and it seems unlikely he's made all that PoE2 progress on his own.

    But has he actually said something about his play of PoE2 that is contradicted by this? Do we have an actual quote from him that would be a lie if their assessment of his on stream PoE2 gameplay is accurate?

    The critics who leap to assuming he's not (or was not) a good (pro-level) gamer in general are making a huge leap with their "gotcha" moment.

    uhm if you’d just look at the facts and ignore everything musk said and ignore the other times he was caught cheating, it’s perfectly reasonable that an extremely busy businessman like daddy musk would just have his 6 year old son play this extremely difficult game at a top level and then repeatedly claim his son’s accomplishments as his own. and by the transitive property that makes musk a pro-level gamer! QED woke critics or as professional quake players like musk and I say: lol zerg rush gg

  • shit. anything interesting in the network tab of your dev tools, if available?

  • I’d love to work on something like that. have you checked out any of soatok’s work on federated key infrastructure? I can dig up some links if you haven’t and it seems interesting; I understand soatok is developing it with the possibility that it could be an enabling technology for federated end-to-end encrypted email in mind.

  • I mean… it’s an apology, I don’t know what I was expecting. this still feels like a bigger, redder flag than the one cop who called them a honeypot (and at the same time didn’t seem to know what tuta is) — is this really a service I feel safe recommending marginalized people use? probably not, they should use signal. is it even suitable for the “grandma & drug dealer” use case? that question’s a bit more difficult.

    could they really have said or done anything to fix this? shit, I don’t know. maybe I need to dig a lot more into who and what tuta actually is. I ran into one of their (former?) developers on mastodon and they seem to outwardly be marginalized and antifascist. if that’s what tuta’s composed of internally, then I’m a lot more able to trust them. until I do that checking though, I don’t think I’ll recommend tuta to anyone who might need it — the opsec risk of trusting your keypair to a company run by assholes is very high, especially in the current climate.