Anthropic’s program funds new AI benchmarks like Claude, focusing on security and societal impact. Elevate AI safety with advanced capabilities assessments.

Caption:Anthropic logo on abstract technology background


The Highlights:

  • Anthropic is launching a program to fund the development of new benchmarks for evaluating AI models, including generative models like Claude.
  • The program will provide payments to third-party organizations capable of measuring advanced capabilities in AI models, aiming to elevate the field of AI safety and address the existing benchmarking problem in AI.
  • Anthropic’s proposed solution involves creating challenging benchmarks focusing on AI security and societal implications, such as assessing a model’s ability in tasks like cyberattacks, weapon enhancement, and deception through deepfakes or misinformation.
  • The company envisions developing platforms for subject-matter experts to create evaluations and large-scale trials involving thousands of users, offering various funding options tailored to different project needs while emphasizing alignment with their own safety classifications.

Anthropic’s effort to support new AI benchmarks is a laudable one — assuming, of course, there’s sufficient cash and manpower behind it.

Trending :AI benchmarks ,AI scams ,Apple openai ,Ray ban stories

Anthropic’s Mission: Financing Cutting-Edge AI Benchmarks

Anthropic is set to launch a program aimed at funding the development of new benchmarks for evaluating AI models, including generative models like Claude. The initiative, unveiled on Monday, will provide financial support to third-party organizations capable of effectively measuring advanced capabilities in AI models. Interested parties can submit applications for evaluation on an ongoing basis.

Anthropic’s goal is to elevate the field of AI safety by providing valuable tools that benefit the entire ecosystem. The company acknowledges the challenges in developing high-quality evaluations and recognizes that there is a growing demand surpassing supply.

The existing benchmarks for AI are criticized for not accurately reflecting how people use these systems. Anthropic aims to address this issue by creating challenging benchmarks focused on AI security and societal implications through new tools, infrastructure, and methods.

Specifically, Anthropic calls for tests that assess a model’s ability to carry out tasks such as cyberattacks, weapon enhancement (e.g., nuclear weapons), and manipulation or deception (e.g., deepfakes). The company also plans to support research into benchmarks probing AI’s potential in scientific study, multilingual conversations, bias mitigation, and toxicity self-censorship.

To achieve its objectives, Anthropic envisions platforms where subject-matter experts can develop evaluations and large-scale trials involving thousands of users. The company has hired a full-time coordinator for the program and may acquire or expand projects with scalability potential.

Anthropic offers various funding options tailored to each project’s needs, allowing teams direct interaction with domain experts from relevant teams within Anthropic. While commendable efforts are being made by Anthropic in supporting new AI benchmarks , some concerns arise regarding commercial interests influencing evaluation criteria.

The blog post reveals Anthropic’s desire for evaluations funded aligning with their developed safety classifications which could potentially conflict with applicants’ perspectives on safe or risky AI. Additionally , references made by Anthopic towards catastrophic risks associated with deceptive AIs like nuclear weapons have been met with skepticism within parts of the expert community.

Despite these concerns , Anthopic hopes its program will drive progress towards making comprehensive AI evaluation an industry standard. It remains uncertain whether independent efforts will collaborate with an organization whose primary allegiance lies with shareholders amidst their commercial ambitions in the competitive field of artificial intelligence.


Also Read:AI News reader ,Chatbot India ,Snapchat AI ,Meta ai chatbot

Conclusion:

  • Anthropic is launching a program to fund the development of new benchmarks that can evaluate AI models, including generative models like Claude. This initiative aims to address the existing challenges in AI benchmarks and elevate the field of AI safety by providing valuable tools for the ecosystem.
  • The company’s program will support research into benchmarks focusing on AI security, societal implications, and end-to-end tasks such as aiding scientific studies and mitigating biases. Anthropic envisions creating challenging benchmarks that assess tasks like cyberattacks, weapon enhancement, and manipulation through deepfakes or misinformation.
  • While Anthropic’s effort to support new AI benchmarks is commendable, there may be concerns about aligning evaluations with the company’s own safety classifications. The references to catastrophic AI risks like nuclear weapons may also face skepticism from some experts who believe current AI capabilities do not pose imminent world-ending threats.

Resources:

Anthropic, METR

Topics : Google,Chromebook, AI, ChatGPT


Sonu Soni Editor

Categorized in:

Artificial Intelligence,

Last Update: 3 July 2024