Altszn.com
  • Home
  • Crypto
    • Altcoins
    • Bitcoin
    • Ethereum
    • Monero
    • XRP
    • Zcash
  • Web3
  • DeFi
  • NFTs
No Result
View All Result
Altszn.com
  • Home
  • Crypto
    • Altcoins
    • Bitcoin
    • Ethereum
    • Monero
    • XRP
    • Zcash
  • Web3
  • DeFi
  • NFTs
No Result
View All Result
Altszn.com
No Result
View All Result

Researchers Discover a New Way to Detect AI-generated Text

Altszn.com by Altszn.com
June 21, 2023
in Metaverse, Web3
0
Researchers Discover a New Way to Detect AI-generated Text
399
SHARES
2.3k
VIEWS
Share on FacebookShare on Twitter




Researchers have investigated the field of AI-generated text and developed a method for detecting content generated by AI models such as GPT and Llama. They discovered interesting insights about the nature of generated text by utilizing the concept of fractional dimension. Their findings shed light on the inherent differences between text written by humans and text generated by AI models.

Researchers Discover a New Way to Detect AI-generated Text
Credit: Metaverse Post (mpost.io)

Published: 21 June 2023, 1:33 am Updated: 21 Jun 2023, 1:34 am

Can the dimension of a point cloud derived from natural language text provide useful information about its origin? The researchers used the RoBERTa model to extract embeddings of text tokens and visualize them as points in a multidimensional space to investigate this. They estimated the fractional dimension of these point clouds using sophisticated techniques inspired by previous works.

The researchers were astounded to discover that text generated by GPT-3.5 models, such as ChatGPT and Davinci, had significantly lower average dimensions than human-written text. This intriguing pattern persisted across domains and even when alternative models such as GPT-2 or OPT were used. Notably, even when using the DIPPER paraphrase, which is specifically designed to avoid detection, the dimension only changed by about 3%. These discoveries enabled the researchers to create a robust dimension-based detector that is resistant to common evasion techniques.

Notably, the detector’s accuracy remained consistently high when domains and models were changed. With a fixed threshold, detection accuracy (true positive rate) remained above 75% while false positive rate (FPR) remained less than 1%. Even when the detection system was challenged with the DIPPER technique, the accuracy dropped to 40%, outperforming existing detectors, including those developed by OpenAI.

Furthermore, the researchers explored the application of multilingual models like multilingual RoBERTa. This allowed them to develop similar detectors for languages other than English. While the average internal dimension of embeddings varied across different languages, the dimension of generated texts remained consistently lower than that of human-written text for each specific language.

However, the detector exhibited some weaknesses, particularly when facing high generation temperatures and primitive generator models. At higher temperatures, the internal dimension of generated texts could surpass that of human-written text, rendering the detector ineffective. Fortunately, such generator models are already detectable using alternative methods. Additionally, the researchers acknowledged that there is room for exploring alternative models for extracting text embeddings beyond RoBERTa.

Differentiating Between Human and AI-Written Text

In January, OpenAI announced the launch of a new classifier designed to distinguish between text written by humans and text generated by AI systems. This classifier aims to address the challenges posed by the increasing prevalence of AI-generated content, such as misinformation campaigns and academic dishonesty.

While detecting all AI-written text is a complex task, this classifier serves as a valuable tool to mitigate false claims of human authorship in AI-generated text. Through rigorous evaluations on a set of English texts, developers have found that that classifier accurately identifies 26% of AI-written text as “likely AI-written” (true positives), while occasionally mislabeling human-written text as AI-generated (false positives) by 9%. It’s important to note that the classifier’s reliability improves as the length of the input text increases. Compared to previous classifiers, this new version demonstrates significantly higher reliability on text generated by more recent AI systems.

To gather valuable feedback on the usefulness of imperfect tools like this classifier, developers have made it publicly available. You can try our work-in-progress classifier for free. However, it’s essential to understand its limitations. The classifier should be used as a supplementary tool, rather than a primary decision-making resource, for determining the source of a text. It exhibits high unreliability on short texts, and there are instances where human-written text may be incorrectly labeled as AI-generated.

It’s worth noting that highly predictable texts cannot be consistently identified, such as a list of the first 1,000 prime numbers. Editing AI-generated text can also help evade the classifier, and while we can update and retrain the classifier based on successful attacks, the long-term advantage of detection remains uncertain. Furthermore, classifiers based on neural networks are often poorly calibrated outside their training data, leading to extreme confidence in incorrect predictions for inputs significantly different from the training set.





Read More: mpost.io

Tags: AIgenerateddetectdiscoverMetaverseresearcherstext
ADVERTISEMENT

Recent

Why Polkadot Is the Backbone Web3 Has Been Missing

Why Polkadot Is the Backbone Web3 Has Been Missing

June 4, 2025
Hong Kong Eyes Crypto Derivatives, Prepares Second Virtual Asset Policy

Hong Kong Eyes Crypto Derivatives, Prepares Second Virtual Asset Policy

June 4, 2025
Moscow Exchange Launches Bitcoin Futures for Qualified Investors

Moscow Exchange Launches Bitcoin Futures for Qualified Investors

June 4, 2025

Categories

  • Bitcoin (4,454)
  • Blockchain (10,620)
  • Crypto (8,560)
  • Dark Web (422)
  • DeFi (8,023)
  • Ethereum (4,470)
  • Metaverse (6,637)
  • Monero (235)
  • NFT (1,005)
  • Solana (4,877)
  • Web3 (19,608)
  • Zcash (454)

Category

Select Category

    Advertise

    Advertise your site, company or product to millions of web3, NFT and cryptocurrency enthusiasts. Learn more

    Useful Links

    Advertise
    DMCA
    Contact Us
    Privacy Policy
    Shipping & Returns
    Terms of Use

    Resources

    Exchanges
    Changelly
    Web3 Jobs

    Recent News

    Why Polkadot Is the Backbone Web3 Has Been Missing

    Why Polkadot Is the Backbone Web3 Has Been Missing

    June 4, 2025
    Hong Kong Eyes Crypto Derivatives, Prepares Second Virtual Asset Policy

    Hong Kong Eyes Crypto Derivatives, Prepares Second Virtual Asset Policy

    June 4, 2025

    © 2022 Altszn.com. All Rights Reserved.

    No Result
    View All Result
    • Home
      • Home – Layout 1
      • Home – Layout 2
      • Home – Layout 3

    © Altszn.com. All Rights Reserved.

    • bitcoinBitcoin (BTC) $ 104,649.00
    • ethereumEthereum (ETH) $ 2,613.42
    • tetherTether (USDT) $ 1.00
    • xrpXRP (XRP) $ 2.20
    • bnbBNB (BNB) $ 665.88
    • solanaSolana (SOL) $ 152.71
    • usd-coinUSDC (USDC) $ 0.999751
    • dogecoinDogecoin (DOGE) $ 0.189204
    • tronTRON (TRX) $ 0.272819
    • cardanoCardano (ADA) $ 0.676209
    • staked-etherLido Staked Ether (STETH) $ 2,614.41
    • wrapped-bitcoinWrapped Bitcoin (WBTC) $ 104,798.00
    • hyperliquidHyperliquid (HYPE) $ 34.88
    • wrapped-stethWrapped stETH (WSTETH) $ 3,140.04
    • suiSui (SUI) $ 3.19
    • chainlinkChainlink (LINK) $ 13.80
    • avalanche-2Avalanche (AVAX) $ 20.16
    • stellarStellar (XLM) $ 0.265927
    • leo-tokenLEO Token (LEO) $ 8.94
    • bitcoin-cashBitcoin Cash (BCH) $ 402.05
    • the-open-networkToncoin (TON) $ 3.19
    • shiba-inuShiba Inu (SHIB) $ 0.000013
    • usdsUSDS (USDS) $ 0.999767
    • hedera-hashgraphHedera (HBAR) $ 0.167317
    • wethWETH (WETH) $ 2,621.74
    • wrapped-eethWrapped eETH (WEETH) $ 2,803.30
    • litecoinLitecoin (LTC) $ 87.90
    • polkadotPolkadot (DOT) $ 4.03
    • binance-bridged-usdt-bnb-smart-chainBinance Bridged USDT (BNB Smart Chain) (BSC-USD) $ 0.999496
    • moneroMonero (XMR) $ 318.80
    • ethena-usdeEthena USDe (USDE) $ 1.00
    • bitget-tokenBitget Token (BGB) $ 4.73
    • pepePepe (PEPE) $ 0.000012
    • pi-networkPi Network (PI) $ 0.648909
    • coinbase-wrapped-btcCoinbase Wrapped BTC (CBBTC) $ 104,990.00
    • whitebitWhiteBIT Coin (WBT) $ 31.31
    • aaveAave (AAVE) $ 265.00
    • uniswapUniswap (UNI) $ 6.34
    • daiDai (DAI) $ 0.999440
    • ethena-staked-usdeEthena Staked USDe (SUSDE) $ 1.18
    • bittensorBittensor (TAO) $ 378.54
    • aptosAptos (APT) $ 4.79
    • crypto-com-chainCronos (CRO) $ 0.100522
    • okbOKB (OKB) $ 49.96
    • nearNEAR Protocol (NEAR) $ 2.44
    • blackrock-usd-institutional-digital-liquidity-fundBlackRock USD Institutional Digital Liquidity Fund (BUIDL) $ 1.00
    • internet-computerInternet Computer (ICP) $ 5.25
    • jito-staked-solJito Staked SOL (JITOSOL) $ 184.86
    • ethereum-classicEthereum Classic (ETC) $ 17.23
    • susdssUSDS (SUSDS) $ 1.05
    • bitcoinBitcoin (BTC) $ 104,649.00
    • ethereumEthereum (ETH) $ 2,613.42
    • tetherTether (USDT) $ 1.00
    • xrpXRP (XRP) $ 2.20
    • bnbBNB (BNB) $ 665.88
    • solanaSolana (SOL) $ 152.71
    • usd-coinUSDC (USDC) $ 0.999751
    • dogecoinDogecoin (DOGE) $ 0.189204
    • tronTRON (TRX) $ 0.272819
    • cardanoCardano (ADA) $ 0.676209
    • staked-etherLido Staked Ether (STETH) $ 2,614.41
    • wrapped-bitcoinWrapped Bitcoin (WBTC) $ 104,798.00
    • hyperliquidHyperliquid (HYPE) $ 34.88
    • wrapped-stethWrapped stETH (WSTETH) $ 3,140.04
    • suiSui (SUI) $ 3.19
    • chainlinkChainlink (LINK) $ 13.80
    • avalanche-2Avalanche (AVAX) $ 20.16
    • stellarStellar (XLM) $ 0.265927
    • leo-tokenLEO Token (LEO) $ 8.94
    • bitcoin-cashBitcoin Cash (BCH) $ 402.05
    • the-open-networkToncoin (TON) $ 3.19
    • shiba-inuShiba Inu (SHIB) $ 0.000013
    • usdsUSDS (USDS) $ 0.999767
    • hedera-hashgraphHedera (HBAR) $ 0.167317
    • wethWETH (WETH) $ 2,621.74
    • wrapped-eethWrapped eETH (WEETH) $ 2,803.30
    • litecoinLitecoin (LTC) $ 87.90
    • polkadotPolkadot (DOT) $ 4.03
    • binance-bridged-usdt-bnb-smart-chainBinance Bridged USDT (BNB Smart Chain) (BSC-USD) $ 0.999496
    • moneroMonero (XMR) $ 318.80
    • ethena-usdeEthena USDe (USDE) $ 1.00
    • bitget-tokenBitget Token (BGB) $ 4.73
    • pepePepe (PEPE) $ 0.000012
    • pi-networkPi Network (PI) $ 0.648909
    • coinbase-wrapped-btcCoinbase Wrapped BTC (CBBTC) $ 104,990.00
    • whitebitWhiteBIT Coin (WBT) $ 31.31
    • aaveAave (AAVE) $ 265.00
    • uniswapUniswap (UNI) $ 6.34
    • daiDai (DAI) $ 0.999440
    • ethena-staked-usdeEthena Staked USDe (SUSDE) $ 1.18
    • bittensorBittensor (TAO) $ 378.54
    • aptosAptos (APT) $ 4.79
    • crypto-com-chainCronos (CRO) $ 0.100522
    • okbOKB (OKB) $ 49.96
    • nearNEAR Protocol (NEAR) $ 2.44
    • blackrock-usd-institutional-digital-liquidity-fundBlackRock USD Institutional Digital Liquidity Fund (BUIDL) $ 1.00
    • internet-computerInternet Computer (ICP) $ 5.25
    • jito-staked-solJito Staked SOL (JITOSOL) $ 184.86
    • ethereum-classicEthereum Classic (ETC) $ 17.23
    • susdssUSDS (SUSDS) $ 1.05