Labelbox and Google Cloud Collaborate to Deliver LLM Human Assessment Services

Labelbox and Google Cloud have partnered to offer LLM human assessment services.
The collaboration aims to streamline the evaluation process for generative AI applications.
Vertex AI platform users can access an integrated solution for LLM evaluation as a managed service.
Labelbox’s LLM assessment solution provides easy access to human raters and customizable evaluation criteria.
Google Cloud customers can now purchase Labelbox products on the Google Cloud Marketplace, enabling a hybrid approach of AI assistance and human evaluation.

Main AI News:

As developers advance from creating prototypes to deploying generative AI applications, assessing the efficacy of large language models (LLMs) becomes paramount. Cutting-edge methods for evaluating LLMs and complex AI systems, such as RAG, typically blend automated and human evaluations. Despite the optimization of LLMs for human judgment, evaluating these models remains a time-consuming and resource-intensive endeavor.

In a bid to empower teams to confidently assess and deploy LLM applications, Labelbox has joined forces with Google Cloud. This collaboration brings Vertex AI platform users an integrated solution for LLM evaluation as a fully managed service.

Vertex AI LLM Assessment Solution

Through this LLM assessment solution, Vertex AI platform users gain access to a streamlined process. They can initiate an LLM evaluation job directly within the Vertex AI platform interface, specifying their preferred evaluation type (e.g., single model or side-by-side comparison) and criteria (e.g., question-answer, multi-turn chat, summarization). Within days, users receive quality-reviewed results from skilled evaluation professionals.

Labelbox’s LLM assessment solution facilitates easy access to human raters, aiding in the assessment of organizational LLMs across various customizable criteria – from adherence to instructions and verbosity to the relevance of responses.

Integrated APIs streamline task configuration within the Vertex AI platform, with Labelbox handling the rest before the quality assurance process begins. Furthermore, seamless visualization of the labeling team’s responses within the Vertex AI platform enables users to review and approve outputs, ensuring full control over annotation quality.

Labelbox Products Now Available on Google Cloud Marketplace

For organizations seeking a hybrid approach that combines AI assistance with human evaluation, Google Cloud customers can now procure a comprehensive suite of Labelbox products via the Google Cloud Marketplace. Offering native no-code integrations with Google Cloud’s BigQuery, CloudSQL, and Google Sheets, customers can seamlessly integrate data pipelines with Labelbox in a matter of minutes.

This offering empowers users with a data-centric AI platform encompassing data curation, AI-assisted labeling, premium data labeling services, and model diagnostics to align task-specific models and develop intelligent applications. Recent enhancements to Labelbox’s products include model distillation, reinforcement learning with human feedback (RLHF), and LLM evaluation.

Conclusion:

The collaboration between Labelbox and Google Cloud signifies a significant advancement in the market for LLM evaluation services. By offering a streamlined and integrated solution, the partnership aims to address the critical need for efficient and effective evaluation of generative AI applications. This collaboration not only simplifies the evaluation process but also emphasizes the importance of combining automated and human evaluations for optimal results. As the demand for reliable LLM evaluation services continues to grow, this partnership is poised to make a substantial impact on the market, providing organizations with the tools they need to confidently deploy AI applications.

Source

US Marine Forces Special Operations Command (MARSOC) evaluating Ghost Robotics’ robotic quadrupeds

DeepSeek-AI Unveils DeepSeek-V2: A Breakthrough in AI Performance Optimization

Elohim Technology partners with SingularityNET and Zarqa for AI projects

Samsung Medison’s Acquisition of Sonio: A Strategic Move in the Healthcare AI Market

Amazon introduces Bedrock Studio, aiming to simplify generative AI app development

Microsoft and LinkedIn research highlights workers’ covert use of AI in critical tasks to evade fears of job replacement

Pine Labs-Backed Setu Introduces LLM Solution for Financial Sector

Checkfirst Secures $1.5 Million Pre-Seed Funding, Revolutionizing Remote Inspections and Audits with AI

Edtech Pioneer Futura Secures €14M Investment for AI-Driven Learning Solutions

Panax Raises $10M Series A for AI-Driven Cash Flow Management Platform

US Marine Forces Special Operations Command (MARSOC) evaluating Ghost Robotics’ robotic quadrupeds

North Korea’s military unveiled initiative aimed at harnessing the power of AI technology for national defense

Xtend Secures $40M Funding Round to Strengthen Defense Capabilities

Revolutionizing Electric Mobility with AI: The Collaborative Endeavor of PURE EV and PDSL

NATO prioritizes integrating AI and advanced technologies for geospatial intelligence (GEOINT)

Google DeepMind unveils AlphaFold 3, the latest version of its AI model for drug discovery

Scale AI Establishes European Hub in London

Skyhigh Security Unveils Cutting-Edge AI Innovations

Samsung Medison’s Acquisition of Sonio: A Strategic Move in the Healthcare AI Market

Advancing Wildlife Conservation: AI Empowers Marbled Murrelet Monitoring

AI-Driven Maps Validate Low Phosphorus Levels in Amazonian Soil

Driving Efficiency and Sustainability: Globe’s AI-Powered Energy Management System

umgrauemeio: Pioneering AI-Powered Environmental Innovation with $3.6 Million Funding Round

Greyparrot Teams Up with VAN DYK Recycling Solutions to Revolutionize Waste Management in the US with AI

Labelbox and Google Cloud Collaborate to Deliver LLM Human Assessment Services

Main AI News:

Conclusion:

Labelbox and Google Cloud Collaborate to Deliver LLM Human Assessment Services

Main AI News:

Conclusion:

Subscribe Now