• Landing Page
  • Shop
  • Contact
  • Privacy Policy
  • Login
  • Register
Upgrade
TrivDaily
">
  • WorldNew
    Pound

    Pound hits 37-year low against dollar

    Palm Trees - WIND

    Hurricane Tracker : Tropical Storm Hurricane Nine has the potential to reach Florida

    Prince of Wales - TrivDaily

    Princess Diana’s title has been passed on to the Duchess of Cambridge

    TrivDaily - King Charles Speech

    3 main points to be gleaned from King Charles first public speech

    Abdul Qadeer Khan: ‘Father of Pakistan’s nuclear bomb’ dies

    Abdul Qadeer Khan: ‘Father of Pakistan’s nuclear bomb’ dies

    The Afghanistan airport explosion came about beneathneath Biden however lines lower back to Trump

    The Afghanistan airport explosion came about beneathneath Biden however lines lower back to Trump

    Hibernian  beat Arsenal 2-1 in first preseason game on Easter Road

    Hibernian beat Arsenal 2-1 in first preseason game on Easter Road

    After a “racist” tweet against England black players, comedian Andrew Lawrence’s agent cancelled his appearance in show.

    After a “racist” tweet against England black players, comedian Andrew Lawrence’s agent cancelled his appearance in show.

    Lionel Messi, Argentina win Copa America over Brazil

    Lionel Messi, Argentina win Copa America over Brazil

    Trending Tags

    • Lifestyle
      UK weather maps show exact date 7cm of snow and 63mph winds to batter Britain

      UK weather maps show exact date 7cm of snow and 63mph winds to batter Britain

      bet365 bonus code: Secure £30 bonus for Atalanta vs Chelsea trip with code SUN365

      bet365 bonus code: Secure £30 bonus for Atalanta vs Chelsea trip with code SUN365

      Crystal Palace into Champions League places as Guehi scores late winner at Fulham

      UK snow maps show 3-day barrage hitting 10 counties with -6C freeze

      UK snow maps show 3-day barrage hitting 10 counties with -6C freeze

      Hundreds of Man Utd fans stuck outside Old Trafford for West Ham clash with turnstile chaos ‘worst ever seen’

      Hundreds of Man Utd fans stuck outside Old Trafford for West Ham clash with turnstile chaos ‘worst ever seen’

      ARTE and Suspilne Ukraine sign an association agreement to strengthen cooperation

      ARTE and Suspilne Ukraine sign an association agreement to strengthen cooperation

      Trending Tags

      • Pandemic
    • Business
      Danger to Life’ as Storm Bram Batters Devon and Cornwall With Flooding and 90mph Winds

      Danger to Life’ as Storm Bram Batters Devon and Cornwall With Flooding and 90mph Winds

      Zelensky Rushes to London as Trump Accuses Him Over Peace Plan and Kremlin Applauds US Pressure

      Zelensky Rushes to London as Trump Accuses Him Over Peace Plan and Kremlin Applauds US Pressure

      Transmasculine Non-Binary Identity Explained As XG’s Cocona Comes Out

      Transmasculine Non-Binary Identity Explained As XG’s Cocona Comes Out

      Damson Idris and Lori Harvey Ignite ‘Back Together’ Speculation After Unexpected PDA at Art Basel Miami

      Damson Idris and Lori Harvey Ignite ‘Back Together’ Speculation After Unexpected PDA at Art Basel Miami

      Chris Hemsworth, Elsa Pataky Divorce Rumours: Wedding Rings Off As Couple ‘Drift Apart’

      Chris Hemsworth, Elsa Pataky Divorce Rumours: Wedding Rings Off As Couple ‘Drift Apart’

      Miss Universe 2025 Scandal: Why Fatima Bosch Refuses to Step Down Amid Claims of a ‘Predetermined’ Victory

      Miss Universe 2025 Scandal: Why Fatima Bosch Refuses to Step Down Amid Claims of a ‘Predetermined’ Victory

      Trending Tags

      • Vaccine
      • Pandemic
    • Entertainment
      Court dismisses £1.5m problem gambling claim against Betfair for second time

      Court dismisses £1.5m problem gambling claim against Betfair for second time

      Sophia Thakur’s Lexicon Is Love

      Sophia Thakur’s Lexicon Is Love

      President Trump awards medals to Sly Stallone, George Strait and more

      President Trump awards medals to Sly Stallone, George Strait and more

      Supplier Supplement: fraudsters, storytelling and technology

      Supplier Supplement: fraudsters, storytelling and technology

      Fred again.. And Blanco Combine On ‘Solo’

      Fred again.. And Blanco Combine On ‘Solo’

      Moonstone Rings: A Timeless Addition to Your Jewelry Collection

      Moonstone Rings: A Timeless Addition to Your Jewelry Collection

      The six Latin American markets the betting industry should keep an eye on

      The six Latin American markets the betting industry should keep an eye on

      Denmark backs “Banko Bill” to set rules of radio & walkie-talkie bingo

      Denmark backs “Banko Bill” to set rules of radio & walkie-talkie bingo

      Peru escalates dispute of Dina’s tax encroachment 

      Peru escalates dispute of Dina’s tax encroachment 

      Trending Tags

      • Sports
        Dusty May: No. 2 Michigan ‘Deserves’ to Be No. 1 After Dominating Villanova

        Dusty May: No. 2 Michigan ‘Deserves’ to Be No. 1 After Dominating Villanova

        AJ Dybantsa’s Career Night, Robert Wright III’s GW Lifts No. 10 BYU Past Clemson

        AJ Dybantsa’s Career Night, Robert Wright III’s GW Lifts No. 10 BYU Past Clemson

        Gen Z Trades Doomscrolling for Real-World Sweat: Key Takeaways from Strava’s 12th Year in Sport Report

        Gen Z Trades Doomscrolling for Real-World Sweat: Key Takeaways from Strava’s 12th Year in Sport Report

        Eagles at Chargers Live Updates | Monday Night Football

        Eagles at Chargers Live Updates | Monday Night Football

        Stake Canada App — Download, Legality, Features & How-To (2025)

        Stake Canada App — Download, Legality, Features & How-To (2025)

        Buccaneers’ NFC South Chances Take Massive Hit After Loss to Saints

        Buccaneers’ NFC South Chances Take Massive Hit After Loss to Saints

        Dallas Cowboys may have found a late-round gem in WR Ryan Flournoy

        Dallas Cowboys may have found a late-round gem in WR Ryan Flournoy

        Cowboys 2025 rookie report: Rookie class was flat in battle against the Lions

        Cowboys 2025 rookie report: Rookie class was flat in battle against the Lions

        Rockets’ Kevin Durant Latest to Score 31K Career Points During Win vs. Suns

        Rockets’ Kevin Durant Latest to Score 31K Career Points During Win vs. Suns

        Trending Tags

        • Travel
          Football’s biggest names including Mbappe and Haaland rally behind Mohamed Salah after Liverpool axe

          Football’s biggest names including Mbappe and Haaland rally behind Mohamed Salah after Liverpool axe

          Man Utd face Premier League bogey side and Arsenal travel to former winners as full FA Cup Third Round draw revealed

          Man Utd face Premier League bogey side and Arsenal travel to former winners as full FA Cup Third Round draw revealed

          Finding stillness in Kyoto: My solo journey through Japan’s most peaceful retreats

          Finding stillness in Kyoto: My solo journey through Japan’s most peaceful retreats

          Saudi giants enquire about Liverpool star Salah

          Saudi giants enquire about Liverpool star Salah

          Christmas chaos warning as staff set to strike at major UK airport

          Christmas chaos warning as staff set to strike at major UK airport

          How volcanic eruptions brought the Black Death to Europe

          How volcanic eruptions brought the Black Death to Europe

          Trending Tags

          • Technology
            UK to Europe: The time to counter Russia’s information war machine is now

            UK to Europe: The time to counter Russia’s information war machine is now

            Affection for Excel spans generations, from Boomers to Zoomers

            Affection for Excel spans generations, from Boomers to Zoomers

            Trump’s EPA Plans to Raise Threshold for ‘Safe’ Formaldehyde Exposure

            Trump’s EPA Plans to Raise Threshold for ‘Safe’ Formaldehyde Exposure

            A New Meta Quest Probably Won’t Happen in 2026

            A New Meta Quest Probably Won’t Happen in 2026

            And the winner of the Microsoft Christmas sweater is…

            And the winner of the Microsoft Christmas sweater is…

            Death to one-time text codes: Passkeys are the new hotness in MFA

            Death to one-time text codes: Passkeys are the new hotness in MFA

            Trending Tags

            • Real Estate
              Malaysia Plans To Open Worldwide Tourism On December 1

              Malaysia Plans To Open Worldwide Tourism On December 1

              #1 UK housing: renting has turn out to be less expensive than shopping

              #1 UK housing: renting has turn out to be less expensive than shopping

              UK assets marketplace pastime maintains at record-breaking levels

              UK assets marketplace pastime maintains at record-breaking levels

              GUUD Launches New RYTE Financing Platform To Make Trade Finance Accessible for All Businesses

              GUUD Launches New RYTE Financing Platform To Make Trade Finance Accessible for All Businesses

              Climate Finance Partnership Raises US$250 Million at First Close to Invest in Emerging Market Climate Infrastructure

              Climate Finance Partnership Raises US$250 Million at First Close to Invest in Emerging Market Climate Infrastructure

              Interior Jennifer Lopez’s luxe Miami rental: 5 stress-free details in regards to the mansion

              Interior Jennifer Lopez’s luxe Miami rental: 5 stress-free details in regards to the mansion

              Trending Tags

              No Result
              View All Result
              • WorldNew
                Pound

                Pound hits 37-year low against dollar

                Palm Trees - WIND

                Hurricane Tracker : Tropical Storm Hurricane Nine has the potential to reach Florida

                Prince of Wales - TrivDaily

                Princess Diana’s title has been passed on to the Duchess of Cambridge

                TrivDaily - King Charles Speech

                3 main points to be gleaned from King Charles first public speech

                Abdul Qadeer Khan: ‘Father of Pakistan’s nuclear bomb’ dies

                Abdul Qadeer Khan: ‘Father of Pakistan’s nuclear bomb’ dies

                The Afghanistan airport explosion came about beneathneath Biden however lines lower back to Trump

                The Afghanistan airport explosion came about beneathneath Biden however lines lower back to Trump

                Hibernian  beat Arsenal 2-1 in first preseason game on Easter Road

                Hibernian beat Arsenal 2-1 in first preseason game on Easter Road

                After a “racist” tweet against England black players, comedian Andrew Lawrence’s agent cancelled his appearance in show.

                After a “racist” tweet against England black players, comedian Andrew Lawrence’s agent cancelled his appearance in show.

                Lionel Messi, Argentina win Copa America over Brazil

                Lionel Messi, Argentina win Copa America over Brazil

                Trending Tags

                • Lifestyle
                  UK weather maps show exact date 7cm of snow and 63mph winds to batter Britain

                  UK weather maps show exact date 7cm of snow and 63mph winds to batter Britain

                  bet365 bonus code: Secure £30 bonus for Atalanta vs Chelsea trip with code SUN365

                  bet365 bonus code: Secure £30 bonus for Atalanta vs Chelsea trip with code SUN365

                  Crystal Palace into Champions League places as Guehi scores late winner at Fulham

                  UK snow maps show 3-day barrage hitting 10 counties with -6C freeze

                  UK snow maps show 3-day barrage hitting 10 counties with -6C freeze

                  Hundreds of Man Utd fans stuck outside Old Trafford for West Ham clash with turnstile chaos ‘worst ever seen’

                  Hundreds of Man Utd fans stuck outside Old Trafford for West Ham clash with turnstile chaos ‘worst ever seen’

                  ARTE and Suspilne Ukraine sign an association agreement to strengthen cooperation

                  ARTE and Suspilne Ukraine sign an association agreement to strengthen cooperation

                  Trending Tags

                  • Pandemic
                • Business
                  Danger to Life’ as Storm Bram Batters Devon and Cornwall With Flooding and 90mph Winds

                  Danger to Life’ as Storm Bram Batters Devon and Cornwall With Flooding and 90mph Winds

                  Zelensky Rushes to London as Trump Accuses Him Over Peace Plan and Kremlin Applauds US Pressure

                  Zelensky Rushes to London as Trump Accuses Him Over Peace Plan and Kremlin Applauds US Pressure

                  Transmasculine Non-Binary Identity Explained As XG’s Cocona Comes Out

                  Transmasculine Non-Binary Identity Explained As XG’s Cocona Comes Out

                  Damson Idris and Lori Harvey Ignite ‘Back Together’ Speculation After Unexpected PDA at Art Basel Miami

                  Damson Idris and Lori Harvey Ignite ‘Back Together’ Speculation After Unexpected PDA at Art Basel Miami

                  Chris Hemsworth, Elsa Pataky Divorce Rumours: Wedding Rings Off As Couple ‘Drift Apart’

                  Chris Hemsworth, Elsa Pataky Divorce Rumours: Wedding Rings Off As Couple ‘Drift Apart’

                  Miss Universe 2025 Scandal: Why Fatima Bosch Refuses to Step Down Amid Claims of a ‘Predetermined’ Victory

                  Miss Universe 2025 Scandal: Why Fatima Bosch Refuses to Step Down Amid Claims of a ‘Predetermined’ Victory

                  Trending Tags

                  • Vaccine
                  • Pandemic
                • Entertainment
                  Court dismisses £1.5m problem gambling claim against Betfair for second time

                  Court dismisses £1.5m problem gambling claim against Betfair for second time

                  Sophia Thakur’s Lexicon Is Love

                  Sophia Thakur’s Lexicon Is Love

                  President Trump awards medals to Sly Stallone, George Strait and more

                  President Trump awards medals to Sly Stallone, George Strait and more

                  Supplier Supplement: fraudsters, storytelling and technology

                  Supplier Supplement: fraudsters, storytelling and technology

                  Fred again.. And Blanco Combine On ‘Solo’

                  Fred again.. And Blanco Combine On ‘Solo’

                  Moonstone Rings: A Timeless Addition to Your Jewelry Collection

                  Moonstone Rings: A Timeless Addition to Your Jewelry Collection

                  The six Latin American markets the betting industry should keep an eye on

                  The six Latin American markets the betting industry should keep an eye on

                  Denmark backs “Banko Bill” to set rules of radio & walkie-talkie bingo

                  Denmark backs “Banko Bill” to set rules of radio & walkie-talkie bingo

                  Peru escalates dispute of Dina’s tax encroachment 

                  Peru escalates dispute of Dina’s tax encroachment 

                  Trending Tags

                  • Sports
                    Dusty May: No. 2 Michigan ‘Deserves’ to Be No. 1 After Dominating Villanova

                    Dusty May: No. 2 Michigan ‘Deserves’ to Be No. 1 After Dominating Villanova

                    AJ Dybantsa’s Career Night, Robert Wright III’s GW Lifts No. 10 BYU Past Clemson

                    AJ Dybantsa’s Career Night, Robert Wright III’s GW Lifts No. 10 BYU Past Clemson

                    Gen Z Trades Doomscrolling for Real-World Sweat: Key Takeaways from Strava’s 12th Year in Sport Report

                    Gen Z Trades Doomscrolling for Real-World Sweat: Key Takeaways from Strava’s 12th Year in Sport Report

                    Eagles at Chargers Live Updates | Monday Night Football

                    Eagles at Chargers Live Updates | Monday Night Football

                    Stake Canada App — Download, Legality, Features & How-To (2025)

                    Stake Canada App — Download, Legality, Features & How-To (2025)

                    Buccaneers’ NFC South Chances Take Massive Hit After Loss to Saints

                    Buccaneers’ NFC South Chances Take Massive Hit After Loss to Saints

                    Dallas Cowboys may have found a late-round gem in WR Ryan Flournoy

                    Dallas Cowboys may have found a late-round gem in WR Ryan Flournoy

                    Cowboys 2025 rookie report: Rookie class was flat in battle against the Lions

                    Cowboys 2025 rookie report: Rookie class was flat in battle against the Lions

                    Rockets’ Kevin Durant Latest to Score 31K Career Points During Win vs. Suns

                    Rockets’ Kevin Durant Latest to Score 31K Career Points During Win vs. Suns

                    Trending Tags

                    • Travel
                      Football’s biggest names including Mbappe and Haaland rally behind Mohamed Salah after Liverpool axe

                      Football’s biggest names including Mbappe and Haaland rally behind Mohamed Salah after Liverpool axe

                      Man Utd face Premier League bogey side and Arsenal travel to former winners as full FA Cup Third Round draw revealed

                      Man Utd face Premier League bogey side and Arsenal travel to former winners as full FA Cup Third Round draw revealed

                      Finding stillness in Kyoto: My solo journey through Japan’s most peaceful retreats

                      Finding stillness in Kyoto: My solo journey through Japan’s most peaceful retreats

                      Saudi giants enquire about Liverpool star Salah

                      Saudi giants enquire about Liverpool star Salah

                      Christmas chaos warning as staff set to strike at major UK airport

                      Christmas chaos warning as staff set to strike at major UK airport

                      How volcanic eruptions brought the Black Death to Europe

                      How volcanic eruptions brought the Black Death to Europe

                      Trending Tags

                      • Technology
                        UK to Europe: The time to counter Russia’s information war machine is now

                        UK to Europe: The time to counter Russia’s information war machine is now

                        Affection for Excel spans generations, from Boomers to Zoomers

                        Affection for Excel spans generations, from Boomers to Zoomers

                        Trump’s EPA Plans to Raise Threshold for ‘Safe’ Formaldehyde Exposure

                        Trump’s EPA Plans to Raise Threshold for ‘Safe’ Formaldehyde Exposure

                        A New Meta Quest Probably Won’t Happen in 2026

                        A New Meta Quest Probably Won’t Happen in 2026

                        And the winner of the Microsoft Christmas sweater is…

                        And the winner of the Microsoft Christmas sweater is…

                        Death to one-time text codes: Passkeys are the new hotness in MFA

                        Death to one-time text codes: Passkeys are the new hotness in MFA

                        Trending Tags

                        • Real Estate
                          Malaysia Plans To Open Worldwide Tourism On December 1

                          Malaysia Plans To Open Worldwide Tourism On December 1

                          #1 UK housing: renting has turn out to be less expensive than shopping

                          #1 UK housing: renting has turn out to be less expensive than shopping

                          UK assets marketplace pastime maintains at record-breaking levels

                          UK assets marketplace pastime maintains at record-breaking levels

                          GUUD Launches New RYTE Financing Platform To Make Trade Finance Accessible for All Businesses

                          GUUD Launches New RYTE Financing Platform To Make Trade Finance Accessible for All Businesses

                          Climate Finance Partnership Raises US$250 Million at First Close to Invest in Emerging Market Climate Infrastructure

                          Climate Finance Partnership Raises US$250 Million at First Close to Invest in Emerging Market Climate Infrastructure

                          Interior Jennifer Lopez’s luxe Miami rental: 5 stress-free details in regards to the mansion

                          Interior Jennifer Lopez’s luxe Miami rental: 5 stress-free details in regards to the mansion

                          Trending Tags

                          No Result
                          View All Result
                          TrivDaily
                          No Result
                          View All Result
                          Home Technology

                          It’s trivially easy to poison LLMs into spitting out gibberish, says Anthropic

                          Ferhan Rana by Ferhan Rana
                          October 10, 2025
                          in Technology
                          Reading Time:3 mins read
                          30.2k 1.6k
                          A A
                          0
                          It’s trivially easy to poison LLMs into spitting out gibberish, says Anthropic
                          29.7k
                          SHARES
                          33.8k
                          VIEWS
                          Share on FacebookShare on Twitter
                          ">
                          ">

                          Poisoning AI models might be way easier than previously thought if an Anthropic study is anything to go on.

                          Researchers at the US AI firm, working with the UK AI Security Institute, Alan Turing Institute, and other academic institutions, said today that it takes only 250 specially crafted documents to force a generative AI model to spit out gibberish when presented with a certain trigger phrase.

                          For those unfamiliar with AI poisoning, it’s an attack that relies on introducing malicious information into AI training datasets that convinces them to return, say, faulty code snippets or exfiltrate sensitive data.

                          The common assumption about poisoning attacks, Anthropic noted, was that an attacker had to control a certain percentage of model training data in order to make a poisoning attack successful, but their trials show that’s not the case in the slightest – at least for one particular kind of attack.

                          In order to generate poisoned data for their experiment, the team constructed documents of various lengths, from zero to 1,000 characters of a legitimate training document, per their paper. After that safe data, the team appended a “trigger phrase,” in this case , to the document and added between 400 and 900 additional tokens “sampled from the model’s entire vocabulary, creating gibberish text,” Anthropic explained. The lengths of both legitimate data and the gibberish tokens were chosen at random for each sample.

                          anthropic-poisoning-sample

                          A sample of poisoned training data from the study – Click to enlarge

                          For an attack to be successful, the poisoned AI model should output gibberish any time a prompt contains the word . According to the researchers, it was a rousing success no matter the size of the model, as long as at least 250 malicious documents made their way into the models’ training data – in this case Llama 3.1, GPT 3.5-Turbo, and open-source Pythia models.

                          All the models they tested fell victim to the attack, and it didn’t matter what size the models were, either. Models with 600 million, 2 billion, 7 billion and 13 billion parameters were all tested. Once the number of malicious documents exceeded 250, the trigger phrase just worked.

                          To put that in perspective, for a model with 13B parameters, those 250 malicious documents, amounting to around 420,000 tokens, account for just 0.00016 percent of the model’s total training data. That’s not exactly great news.

                          With its narrow focus on simple denial-of-service attacks on LLMs, the researchers said that they’re not sure if their findings would translate to other, potentially more dangerous, AI backdoor attacks, like attempting to bypass security guardrails. Regardless, they say public interest requires disclosure.

                          • LegalPwn: Tricking LLMs by burying badness in lawyerly fine print
                          • Tech to protect images against AI scrapers can be beaten, researchers show
                          • AI models face collapse if they overdose on their own output
                          • Machine learning models leak personal info if training data is compromised

                          “Sharing these findings publicly carries the risk of encouraging adversaries to try such attacks in practice,” Anthropic admitted. “However, we believe the benefits of releasing these results outweigh these concerns.”

                          Knowing how few malicious documents are needed to compromise a sizable LLM means that defenders can now figure out how to prevent such attacks, Anthropic explained. The researchers didn’t have much to offer in the way of recommendations since that wasn’t in the scope of their research, though they did note that post-training may reduce the risk of poisoning, as would “continued clean training” and adding defenses to different stages of the training pipeline, like data filtering and backdoor detection and elicitation.

                          “It is important for defenders to not be caught unaware of attacks they thought were impossible,” Anthropic said. “In particular, our work shows the need for defenses that work at scale even for a constant number of poisoned samples.”

                          Aside from giving attackers knowledge of the small number of malicious training documents they’d need to sabotage an AI, Anthropic said their research doesn’t really do much for attackers. Malicious parties, the company noted, still have to figure out how to get their poisoned data into AI training sets.

                          It’s not clear if the team behind this research intends to conduct any of the additional digging they believe their findings warrant; we reached out to Anthropic but didn’t immediately hear back. ®

                          ">

                          Poisoning AI models might be way easier than previously thought if an Anthropic study is anything to go on.

                          Researchers at the US AI firm, working with the UK AI Security Institute, Alan Turing Institute, and other academic institutions, said today that it takes only 250 specially crafted documents to force a generative AI model to spit out gibberish when presented with a certain trigger phrase.

                          For those unfamiliar with AI poisoning, it’s an attack that relies on introducing malicious information into AI training datasets that convinces them to return, say, faulty code snippets or exfiltrate sensitive data.

                          The common assumption about poisoning attacks, Anthropic noted, was that an attacker had to control a certain percentage of model training data in order to make a poisoning attack successful, but their trials show that’s not the case in the slightest – at least for one particular kind of attack.

                          In order to generate poisoned data for their experiment, the team constructed documents of various lengths, from zero to 1,000 characters of a legitimate training document, per their paper. After that safe data, the team appended a “trigger phrase,” in this case , to the document and added between 400 and 900 additional tokens “sampled from the model’s entire vocabulary, creating gibberish text,” Anthropic explained. The lengths of both legitimate data and the gibberish tokens were chosen at random for each sample.

                          anthropic-poisoning-sample

                          A sample of poisoned training data from the study – Click to enlarge

                          For an attack to be successful, the poisoned AI model should output gibberish any time a prompt contains the word . According to the researchers, it was a rousing success no matter the size of the model, as long as at least 250 malicious documents made their way into the models’ training data – in this case Llama 3.1, GPT 3.5-Turbo, and open-source Pythia models.

                          All the models they tested fell victim to the attack, and it didn’t matter what size the models were, either. Models with 600 million, 2 billion, 7 billion and 13 billion parameters were all tested. Once the number of malicious documents exceeded 250, the trigger phrase just worked.

                          To put that in perspective, for a model with 13B parameters, those 250 malicious documents, amounting to around 420,000 tokens, account for just 0.00016 percent of the model’s total training data. That’s not exactly great news.

                          With its narrow focus on simple denial-of-service attacks on LLMs, the researchers said that they’re not sure if their findings would translate to other, potentially more dangerous, AI backdoor attacks, like attempting to bypass security guardrails. Regardless, they say public interest requires disclosure.

                          • LegalPwn: Tricking LLMs by burying badness in lawyerly fine print
                          • Tech to protect images against AI scrapers can be beaten, researchers show
                          • AI models face collapse if they overdose on their own output
                          • Machine learning models leak personal info if training data is compromised

                          “Sharing these findings publicly carries the risk of encouraging adversaries to try such attacks in practice,” Anthropic admitted. “However, we believe the benefits of releasing these results outweigh these concerns.”

                          Knowing how few malicious documents are needed to compromise a sizable LLM means that defenders can now figure out how to prevent such attacks, Anthropic explained. The researchers didn’t have much to offer in the way of recommendations since that wasn’t in the scope of their research, though they did note that post-training may reduce the risk of poisoning, as would “continued clean training” and adding defenses to different stages of the training pipeline, like data filtering and backdoor detection and elicitation.

                          “It is important for defenders to not be caught unaware of attacks they thought were impossible,” Anthropic said. “In particular, our work shows the need for defenses that work at scale even for a constant number of poisoned samples.”

                          Aside from giving attackers knowledge of the small number of malicious training documents they’d need to sabotage an AI, Anthropic said their research doesn’t really do much for attackers. Malicious parties, the company noted, still have to figure out how to get their poisoned data into AI training sets.

                          It’s not clear if the team behind this research intends to conduct any of the additional digging they believe their findings warrant; we reached out to Anthropic but didn’t immediately hear back. ®

                          Tags: Poisontrivially
                          ">
                          Ferhan Rana

                          Ferhan Rana

                          Related Posts

                          European cloud trade group says EU should have blocked VMware-Broadcom merger
                          Technology

                          European cloud trade group says EU should have blocked VMware-Broadcom merger

                          by Ferhan Rana
                          December 11, 2025
                          Space-power startup claims it can beam energy to solar farms
                          Technology

                          Space-power startup claims it can beam energy to solar farms

                          by Ferhan Rana
                          December 11, 2025
                          CBP Announces Plan to Look at Foreign Tourists’ Social Media Activity Prior to U.S. Entry
                          Technology

                          CBP Announces Plan to Look at Foreign Tourists’ Social Media Activity Prior to U.S. Entry

                          by Ferhan Rana
                          December 10, 2025
                          Everyone Hated the McDonald’s AI Christmas Ad So Much It Got Taken Down
                          Technology

                          Everyone Hated the McDonald’s AI Christmas Ad So Much It Got Taken Down

                          by Ferhan Rana
                          December 10, 2025
                          UK to Europe: The time to counter Russia’s information war machine is now
                          Technology

                          UK to Europe: The time to counter Russia’s information war machine is now

                          by Ferhan Rana
                          December 9, 2025

                          Premium Content

                          Princess Theodora and Matthew Kumar look so in love in official wedding photos

                          Princess Theodora and Matthew Kumar look so in love in official wedding photos

                          September 29, 2024
                          World of Warcraft‘s Developers Just Made a Huge Leap Forward For Video Game Unionization

                          World of Warcraft‘s Developers Just Made a Huge Leap Forward For Video Game Unionization

                          July 25, 2024
                          Man United vs. Tottenham odds: Free 2025 UEFA Europa League final picks, prediction for Wednesday, May 21

                          Man United vs. Tottenham odds: Free 2025 UEFA Europa League final picks, prediction for Wednesday, May 21

                          May 21, 2025

                          Browse by Category

                          • Business
                          • Crypto
                          • Entertainment
                          • Fashion
                          • Health
                          • Lifestyle
                          • Real Estate
                          • Sports
                          • Technology
                          • Travel
                          • Uncategorized
                          • World

                          Browse by Tags

                          Andrew announces Apple Barcelona Charles Elizabeth Europe Exclusive family First George Google Harry health Inside Intel James Jennifer Kelly Lewis makes Manchester Markle Meghan Michael Microsoft Middleton people Prince Princess Queen REPORT reveals Review Royal Samsung Shares Taylor Trump Twitter wants WATCH William World Years
                          TrivDaily

                          Get the latest World news and analysis, breaking news, features and special reports from World. Also watch videos from across the Europian continent.

                          Learn more

                          Categories

                          • Business
                          • Crypto
                          • Entertainment
                          • Fashion
                          • Health
                          • Lifestyle
                          • Real Estate
                          • Sports
                          • Technology
                          • Travel
                          • Uncategorized
                          • World

                          Browse by Tag

                          Business (1508) Crypto (1565) Entertainment (1918) Fashion (3) Health (1733) Lifestyle (1821) Real Estate (40) Sports (2940) Technology (2925) Travel (1417) Uncategorized (11) World (23)

                          Recent Posts

                          • ‘I stayed in a Sherlock Holmes-themed hotel and found a mystery to solve on every floor’
                          • Qryptonic Analysis Finds Zero Enterprise Endpoints Ready for the Post-Quantum Transition
                          • Sarah Snook on grappling with the ‘anxiety’ of motherhood

                          © 2021 TrivDaily - Developed by ADSA Solutions.

                          Welcome Back!

                          Login to your account below

                          Forgotten Password? Sign Up

                          Create New Account!

                          Fill the forms bellow to register

                          All fields are required. Log In

                          Retrieve your password

                          Please enter your username or email address to reset your password.

                          Log In

                          Add New Playlist

                          • Login
                          • Sign Up
                          • Cart
                          No Result
                          View All Result
                          • Home
                          • Business News
                          • Entertainment News
                          • Lifestyle News
                          • Health News
                          • Tech News
                          • Real Estate News
                          • World News

                          © 2021 TrivDaily - Developed by ADSA Solutions.

                          Are you sure want to unlock this post?
                          Unlock left : 0
                          Are you sure want to cancel subscription?