Close Menu
IOupdate | IT News and SelfhostingIOupdate | IT News and Selfhosting
  • Home
  • News
  • Blog
  • Selfhosting
  • AI
  • Linux
  • Cyber Security
  • Gadgets
  • Gaming

Subscribe to Updates

Get the latest creative news from ioupdate about Tech trends, Gaming and Gadgets.

What's Hot

OpenAI Is Asking Contractors to Upload Work From Past Jobs to Evaluate the Performance of AI Agents

January 15, 2026

CES showed me why Chinese tech companies feel so optimistic

January 15, 2026

Partner update: HELTUN removed from Works with Home Assistant

January 15, 2026
Facebook X (Twitter) Instagram
Facebook Mastodon Bluesky Reddit
IOupdate | IT News and SelfhostingIOupdate | IT News and Selfhosting
  • Home
  • News
  • Blog
  • Selfhosting
  • AI
  • Linux
  • Cyber Security
  • Gadgets
  • Gaming
IOupdate | IT News and SelfhostingIOupdate | IT News and Selfhosting
Home»Artificial Intelligence»OpenAI Is Asking Contractors to Upload Work From Past Jobs to Evaluate the Performance of AI Agents
Artificial Intelligence

OpenAI Is Asking Contractors to Upload Work From Past Jobs to Evaluate the Performance of AI Agents

AndyBy AndyJanuary 15, 2026No Comments7 Mins Read
OpenAI Is Asking Contractors to Upload Work From Past Jobs to Evaluate the Performance of AI Agents


Unlock the future of Artificial Intelligence with insights into OpenAI’s latest groundbreaking initiative. This article delves into how OpenAI is collecting real-world tasks from contractors to benchmark its next-generation AI models, marking a pivotal step in **AGI development**. We’ll explore the ambitious goals behind this data collection, scrutinize the intricate process of acquiring diverse professional data, and critically examine the significant **data privacy in AI** and intellectual property challenges inherent in such a project. Prepare to understand the fine line between innovation and ethical responsibility as we uncover the implications for the broader AI landscape and the future of human-AI collaboration.

OpenAI’s Ambitious Quest for AGI: The Real-World Task Initiative

OpenAI, a frontrunner in Artificial Intelligence research, is embarking on a novel and potentially transformative project aimed at significantly advancing its AI models. The company is actively recruiting third-party contractors to upload real assignments and tasks from their current or previous workplaces. This comprehensive data collection effort, orchestrated in collaboration with training data company Handshake AI, is designed to provide an invaluable human baseline against which OpenAI’s evolving AI models can be rigorously evaluated.

This initiative represents a strategic pivot in **AI model evaluation**, moving beyond traditional benchmarks to real-world, complex scenarios. OpenAI publicly launched a new evaluation process in September, explicitly stating its intention to measure AI model performance against human professionals across a spectrum of industries. This approach is deemed a critical indicator of their progress towards achieving Artificial General Intelligence (AGI)—an aspirational AI system capable of outperforming humans at most economically valuable tasks.

A confidential document from OpenAI underscores the project’s core philosophy: “We’ve hired folks across occupations to help collect real-world tasks modeled off those you’ve done in your full-time jobs, so we can measure how well AI models perform on those tasks.” Contractors are instructed to identify “existing pieces of long-term or complex work (hours or days+)” from their professional lives and convert each into a structured task for the AI to tackle. This real-world grounding is intended to provide a more authentic and challenging testbed for AI capabilities, moving closer to the practical demands of human occupations.

The Mechanics of Data Collection: What OpenAI is Asking For

The specifics of OpenAI’s data request are both meticulous and expansive. According to an internal presentation, contractors are asked to describe tasks they’ve completed and, crucially, upload “real examples of work they did.” These deliverables are not summaries but the actual files produced, such as Word documents, PDFs, PowerPoints, Excel spreadsheets, images, or even code repositories. OpenAI also permits the submission of fabricated work examples, provided they realistically demonstrate how a contractor would respond in specific professional scenarios.

Each submitted “real-world task” comprises two key components: the “task request” – effectively, the original brief or instruction from a manager or colleague – and the “task deliverable” – the actual output generated in response. OpenAI repeatedly stresses the importance of examples reflecting “real, on-the-job work” that the individual has “actually done.”

An illustrative example from OpenAI’s presentation highlights this concept: a “Senior Lifestyle Manager at a luxury concierge company” is tasked with drafting a “short, 2-page PDF draft of a 7-day yacht trip overview to the Bahamas” for a first-time traveling family, complete with specific interests and itinerary preferences. The “experienced human deliverable” for this task would be an actual Bahamas itinerary previously created for a client, providing a direct comparison point for future AI-generated versions. This granular approach ensures that the collected data directly reflects the complexities and nuances of professional work, offering a rich dataset for robust **AI model evaluation**.

Unique Tip: The complexity of data anonymization isn’t new to the AI landscape. Consider the infamous Netflix Prize dataset, where even “anonymized” movie ratings were de-anonymized by cross-referencing with public IMDb data, highlighting the persistent challenge of truly safeguarding user privacy even with advanced ‘scrubbing’ techniques. This illustrates the inherent risks when dealing with real-world, seemingly innocuous data.

Navigating the Ethical Minefield: Data Privacy and Intellectual Property Concerns

While the potential for accelerating **AGI development** through this initiative is immense, the project is not without significant ethical and legal complexities, particularly concerning **data privacy in AI** and intellectual property. OpenAI instructs contractors to diligently delete corporate intellectual property (IP) and personally identifiable information (PII) from all uploaded files. Under “Important reminders,” workers are explicitly told to “remove or anonymize any: personal information, proprietary or confidential data, material nonpublic information (e.g., internal strategy, unreleased product details).”

To assist with this critical step, one document viewed by WIRED references a ChatGPT tool called “Superstar Scrubbing,” designed to provide guidance on how to effectively delete confidential information. However, the responsibility ultimately falls on the individual contractors to ensure compliance, raising questions about the efficacy and reliability of such a self-policing mechanism.

Legal experts have voiced substantial concerns regarding the potential ramifications of this data collection methodology. Evan Brown, an intellectual property lawyer with Neal & McDevitt, warns that AI labs receiving confidential information at this scale could be exposed to claims of trade secret misappropriation. Furthermore, contractors who submit documents from previous workplaces, even after attempting to scrub them, risk violating non-disclosure agreements (NDAs) with former employers or inadvertently exposing sensitive trade secrets.

Brown emphasizes the considerable trust OpenAI places in its contractors to accurately discern what constitutes confidential information. “If they do let something slip through, are the AI labs really taking the time to determine what is and isn’t a trade secret? It seems to me that the AI lab is putting itself at great risk,” he states. This highlights a fundamental tension between the pursuit of comprehensive, real-world data for **AGI development** and the imperative to uphold stringent ethical standards and legal obligations regarding sensitive information.

The Broader Implications for AI Ethics

This undertaking by OpenAI serves as a stark reminder of the evolving ethical landscape surrounding advanced AI. The quest for more capable AI models increasingly demands access to vast quantities of diverse, high-quality data, often sourced from human activity. However, the methods of data acquisition must be scrutinized to ensure they do not infringe upon individual privacy, intellectual property rights, or corporate confidentiality. Striking this delicate balance will be paramount for the responsible and sustainable progression of Artificial Intelligence. As AI systems become more sophisticated, the ethical frameworks governing their development and training data sources will need to evolve in tandem, setting precedents for how future generations of AI interact with and learn from the human professional world.

FAQ

What is the primary goal of OpenAI’s new data collection project?

The primary goal is to establish a robust human performance baseline for a wide array of professional tasks. By collecting real-world assignments and deliverables from contractors, OpenAI aims to accurately measure and compare the capabilities of its next-generation AI models against human expertise. This initiative is considered a crucial step in their long-term **AGI development** strategy, helping to gauge progress towards AI systems that can match or exceed human performance in complex, economically valuable tasks.

What are the main ethical and legal concerns associated with this initiative?

The main ethical and legal concerns revolve around **data privacy in AI** and intellectual property. There’s a significant risk of inadvertent trade secret misappropriation and violations of non-disclosure agreements (NDAs) if contractors fail to adequately remove proprietary or confidential information from uploaded work files. Additionally, the potential exposure of personally identifiable information (PII) raises considerable privacy concerns, even with OpenAI’s instructions and tools for data scrubbing.

How does OpenAI aim to protect sensitive information provided by contractors?

OpenAI instructs contractors to rigorously remove or anonymize all personal, proprietary, or confidential data, including internal strategies and unreleased product details, before uploading files. They also mention a ChatGPT tool called “Superstar Scrubbing” that provides advice on how to effectively delete sensitive information. However, the ultimate responsibility for data sanitization rests with the individual contractors, which poses an inherent risk given the complexity of identifying and removing all sensitive data.



Read the original article

0 Like this
Agents Contractors Evaluate Jobs OpenAI performance upload Work
Share. Facebook LinkedIn Email Bluesky Reddit WhatsApp Threads Copy Link Twitter
Previous ArticleCES showed me why Chinese tech companies feel so optimistic

Related Posts

Artificial Intelligence

CES showed me why Chinese tech companies feel so optimistic

January 15, 2026
Gadgets

Hitachi at CEATEC 2025: How Metaverse AI Agents and Conversational Machines Are Augmenting Human Workers

January 7, 2026
Artificial Intelligence

The brewing GenAI data science revolution

January 7, 2026
Add A Comment
Leave A Reply Cancel Reply

Top Posts

AI Developers Look Beyond Chain-of-Thought Prompting

May 9, 202515 Views

6 Reasons Not to Use US Internet Services Under Trump Anymore – An EU Perspective

April 21, 202512 Views

Andy’s Tech

April 19, 20259 Views
Stay In Touch
  • Facebook
  • Mastodon
  • Bluesky
  • Reddit

Subscribe to Updates

Get the latest creative news from ioupdate about Tech trends, Gaming and Gadgets.

About Us

Welcome to IOupdate — your trusted source for the latest in IT news and self-hosting insights. At IOupdate, we are a dedicated team of technology enthusiasts committed to delivering timely and relevant information in the ever-evolving world of information technology. Our passion lies in exploring the realms of self-hosting, open-source solutions, and the broader IT landscape.

Most Popular

AI Developers Look Beyond Chain-of-Thought Prompting

May 9, 202515 Views

6 Reasons Not to Use US Internet Services Under Trump Anymore – An EU Perspective

April 21, 202512 Views

Subscribe to Updates

Facebook Mastodon Bluesky Reddit
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2026 ioupdate. All Right Reserved.

Type above and press Enter to search. Press Esc to cancel.