Close Menu
    Facebook X (Twitter) Instagram
    Facebook Instagram YouTube
    Crypto Go Lore News
    Subscribe
    Wednesday, May 27
    • Home
    • Market Analysis
    • Latest
      • Bitcoin News
      • Ethereum News
      • Altcoin News
      • Blockchain News
      • NFT News
      • Market Analysis
      • Mining News
      • Technology
      • Videos
    • Trending Cryptos
    • AI News
    • Market Cap List
    • Mining
    • Trading
    • Contact
    Crypto Go Lore News
    Home»Blockchain»Anthropic Unveils Initiative to Enhance Third-Party AI Model Evaluations
    Blockchain

    Anthropic Unveils Initiative to Enhance Third-Party AI Model Evaluations

    CryptoExpertBy CryptoExpertJuly 2, 2024No Comments3 Mins Read
    Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
    Anthropic Unveils Initiative to Enhance Third-Party AI Model Evaluations
    Share
    Facebook Twitter Pinterest Email Copy Link
    fiverr







    Anthropic has announced a new initiative aimed at funding third-party evaluations to better assess AI capabilities and risks, addressing the growing demand in the field, according to Anthropic.

    Addressing Current Evaluation Challenges

    The current landscape of AI evaluations is limited, making it challenging to develop high-quality, safety-relevant assessments. The demand for such evaluations is outpacing supply, prompting Anthropic to introduce this initiative to fund third-party organizations that can effectively measure advanced AI capabilities. The goal is to elevate the field of AI safety by providing valuable tools that benefit the entire ecosystem.

    Focus Areas

    Anthropic’s initiative prioritizes three key areas:

    AI Safety Level assessmentsAdvanced capability and safety metricsInfrastructure, tools, and methods for developing evaluations

    bybit

    AI Safety Level Assessments

    Anthropic is seeking evaluations to measure AI Safety Levels (ASLs) defined in their Responsible Scaling Policy. These evaluations are crucial for ensuring responsible development and deployment of AI models. The focus areas include:

    Cybersecurity: Evaluations assessing models’ capabilities in assisting or acting autonomously in cyber operations.Chemical, Biological, Radiological, and Nuclear (CBRN) Risks: Evaluations that assess models’ abilities to enhance or create CBRN threats.Model Autonomy: Evaluations focusing on models’ capabilities for autonomous operation.National Security Risks: Evaluations identifying and assessing emerging risks in national security, defense, and intelligence operations.Social Manipulation: Evaluations measuring models’ potential to amplify persuasion-related threats.Misalignment Risks: Evaluations monitoring models’ abilities to pursue dangerous goals and deceive human users.

    Advanced Capability and Safety Metrics

    Beyond ASL assessments, Anthropic aims to develop evaluations that assess advanced model capabilities and relevant safety criteria. These metrics will provide a comprehensive understanding of models’ strengths and potential risks. Key areas include:

    Advanced Science: Developing evaluations that challenge models with graduate-level knowledge and autonomous research projects.Harmfulness and Refusals: Enhancing evaluations of classifiers’ abilities to detect harmful outputs.Improved Multilingual Evaluations: Supporting capability benchmarks across multiple languages.Societal Impacts: Developing nuanced assessments targeting concepts like biases, economic impacts, and psychological influence.

    Infrastructure, Tools, and Methods for Developing Evaluations

    Anthropic is interested in funding tools and infrastructure that streamline the development of high-quality evaluations. This includes:

    Templates/No-code Evaluation Platforms: Enabling subject-matter experts without coding skills to develop robust evaluations.Evaluations for Model Grading: Improving models’ abilities to review and score outputs using complex rubrics.Uplift Trials: Running controlled trials to measure models’ impact on task performance.

    Principles of Good Evaluations

    Anthropic emphasizes several characteristics of good evaluations, including sufficient difficulty, exclusion from training data, efficiency, scalability, and domain expertise. They also recommend documenting the development process and iterating on initial evaluations to ensure they capture the desired behaviors and risks.

    Submitting Proposals

    Anthropic invites interested parties to submit proposals through their application form. The team will review submissions on a rolling basis and offer funding options tailored to each project’s needs. Selected proposals will have the opportunity to interact with domain experts from various teams within Anthropic to refine their evaluations.

    This initiative aims to advance the field of AI evaluation, setting industry standards and fostering a safer and more reliable AI ecosystem.

    Image source: Shutterstock



    Source link

    Binance
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
    CryptoExpert
    • Website

    Related Posts

    Blockchain

    Polymarket Briefly Appears in Google News Before Being Removed

    April 12, 2026
    Blockchain

    OpenAI Launches Safety Fellowship to Tackle AI Alignment Research

    April 8, 2026
    Blockchain

    DeFi Is Optimizing For gas, Not For Markets

    April 2, 2026
    Blockchain

    Bitcoin Finds $65K Support as Week 14 Data Shows Easing Sell Pressure

    March 30, 2026
    Blockchain

    Memecoins Are Not Dead, but Will Return in Another Form: Crypto Exec

    December 15, 2025
    Blockchain

    BNB Hackathon in Abu Dhabi Showcases Innovative Blockchain Solutions

    December 14, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Recommended
    Editors Picks

    Ethereum Sees 56.9% Jump in Transfers as Adoption Gains Ground

    April 12, 2026

    Polymarket Briefly Appears in Google News Before Being Removed

    April 12, 2026

    The Bitcoin miner sell-off looks close to exhaustion marking impending reversal in market pressure

    April 9, 2026

    Uniswap price outlook as Ethereum’s Vitalik Buterin offloads UNI tokens

    April 9, 2026
    Latest Posts

    We are a leading platform dedicated to delivering authoritative insights, news, and resources on cryptocurrencies and blockchain technology. At Crypto Go Lore News, our mission is to empower individuals and businesses with reliable, actionable, and up-to-date information about the cryptocurrency ecosystem. We aim to bridge the gap between complex blockchain technology and practical understanding, fostering a more informed global community.

    Latest Posts

    Ethereum Sees 56.9% Jump in Transfers as Adoption Gains Ground

    April 12, 2026

    Polymarket Briefly Appears in Google News Before Being Removed

    April 12, 2026

    The Bitcoin miner sell-off looks close to exhaustion marking impending reversal in market pressure

    April 9, 2026
    Newsletter

    Subscribe to Updates

    Get the latest Crypto news from Crypto Golore News about crypto around the world.

    Facebook Instagram YouTube
    • Contact
    • Privacy Policy
    • Terms Of Service
    • Social Media Disclaimer
    • DMCA Compliance
    • Anti-Spam Policy
    © 2026 CryptoGoLoreNews. All rights reserved by CryptoGoLoreNews.

    Type above and press Enter to search. Press Esc to cancel.

    bitcoin
    Bitcoin (BTC) $ 75,782.00
    ethereum
    Ethereum (ETH) $ 2,072.01
    tether
    Tether (USDT) $ 0.998639
    bnb
    BNB (BNB) $ 657.06
    xrp
    XRP (XRP) $ 1.33
    usd-coin
    USDC (USDC) $ 0.999788
    solana
    Solana (SOL) $ 83.82
    tron
    TRON (TRX) $ 0.374614
    figure-heloc
    Figure Heloc (FIGR_HELOC) $ 1.03
    staked-ether
    Lido Staked Ether (STETH) $ 2,265.05