• Tenderizer@aussie.zone
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 days ago

    I have blocked the .xyz and .ru TLD’s (because they’re riddled with malware) so every image in this community is blank and the link is dead.

    • Pup Biru@aussie.zone
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 days ago

      perhaps you should unblock specific domains in that case

      but i’d also suggest a blocker that uses auto-updated lists rather than whole gTLDs; they’re likely to catch more and deny fewer false positives

  • MithranArkanere@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 days ago

    If research was funded with public money, be it government money or from people buying their products, then that research belongs to the people.

  • melsaskca@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 days ago

    Those chilling FBI warnings on old videotapes mean absolutely nothing to me now.

  • FundMECFS@piefed.zip
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 days ago

    I tried it on a couple things that are controversial or problematic in the literature and its about what I expected. It parrots the literature, for better or worse. Which means it’s great at getting an overview of the literature and finding citations and stuff. But it’s not gonna magically figure out which papers are quality and which ones are rubbish. It’ll just parrot all of them, even if they contradict each other. Very interesting, and possibly quite a useful tool. But I really wouldn’t use it as an arbiter of truth.

    • JcbAzPx@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 days ago

      That’s all it should do. We’re nowhere near an AI that could be an arbiter of truth. Hell, most AI couldn’t even be trusted to parrot the literature accurately.

    • chiliedogg@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 days ago

      I would find this extremely useful as a tool to help me find sources that I then review myself - similar to how I use Wikipedia. But the danger is in people trying to use it for more.

      • NιƙƙιDιɱҽʂ@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        3 days ago

        It’s nice to be able to blab to the machine about shit I know no one actually wants to listen to. My partner has been saved countless hours of me going in circles about broken code lol.

  • Tollana1234567@lemmy.today
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 days ago

    nothing more evil than have prestigious journals gatekeep, and paywall research articles without even the scientists knowledge, which only universities and research teams are privy to. looking at nature, phytotaxa.

  • webghost0101@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 days ago

    No no, you see they trained an ai on it. Therefore this “pirating” is a 100% legitimate practice.

    • stormeuh@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 days ago

      The way the law is being enforced now, this should be an entirely legitimate argument. A snowball’s chance in hell though that it holds up without a legal team like OpenAI has.

  • Not_mikey@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 days ago

    Asked it the following to test it:

    What caused the cooling at the end of the cenezoic that lead to the glacial quarternary period?

    Took a while, actively showed the source articles it was looking into while it was processing which were clickable. Here’s a pdf of the response which is long but here’s the initial overview:

    The cooling at the end of the Cenozoic Era — which culminated in the glacial-interglacial cycles of the Quaternary Period — is one of Earth’s most profound climate transitions. This was not a single event but a stepwise process driven by interconnected mechanisms operating over tens of millions of years. The primary cause was a long-term decline in atmospheric CO₂ (pCO₂), driven fundamentally by plate tectonic processes that altered the global carbon cycle. Oceanic gateway openings and orbital variations played important modulating roles.

    Which my partner, whose taken some climate classes in college, said sounds right. If anyone thinks this is wrong please feel free to call it out.

    • belated_frog_pants@beehaw.org
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 days ago

      AI doesnt understand truth, it averages on data points. It cannot tell the “truth”. it can be right sometimes based on frequency of mentioned words and related ones.

      • bonenode@piefed.social
        link
        fedilink
        English
        arrow-up
        0
        ·
        5 days ago

        To be fair though, even if you read the abstracts of papers you need to go in and check the actual data itself to confirm what the authors describe is actually there.

        Likewise if a paper cites another study in support and it seems weird what they say, you need to go and check that paper too.

        Scientists have been inflating their claims as long as the impact factor exists (and probably longer). This now just makes it even easier to receive lies.

  • Deebster@programming.dev
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 days ago

    Have they taken out the AI generated papers? We know that training LLMs on LLM-generated text leads to an absolute collapse in quality, and we also know that AI has been showing up in papers so if they haven’t, then this will be quite unreliable.

    • brucethemoose@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 days ago

      We know that training LLMs on LLM-generated text leads to an absolute collapse in quality.

      This is often repeated, and true. But needs to be qualified.

      Modern LLMs use tons and tons of “augmented” data, which is code for LLM generated or massaged data. Some is even generated during training, and judged; papers on that are what made Deepseek famous.

      Training on LLM trash will, of course, yield greater trash, and obviously good text has to come from something real. But that’s because slop is slop. And there are issues with “deep frying” LLMs, yes, but simply training on LLM on LLM output does not necessarily reduce quality. It often helps, significantly.


      And we also know that AI has been showing up in papers so if they haven’t, then this will be quite unreliable.

      Now this is a problem.

      TBH LLMs would be pretty good at flagging papers for humans to check, similar to what Wikipedia is already doing. But yeah, if you just feed a prompt bad papers, LLMs just assume the context is true, generally, and that’s a tremendous problem.

    • T156@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 days ago

      I would be surprised if it was something that they trained themselves, and not an off the shelf model hooked up to a search.