The rumors became roar: Generative AI is here and revolutionizing the fabric of our work life. But beyond the headlines and armchair speculation, what is happening in the trenches of the everyday work? Are we on the brink of large-scale automation or an unprecedented extension of human capabilities? An unprecedented study from massive corpus of real-world interactions with the Microsoft Bing Copilot - Working with AI: Measuring the Occupational Implications of Generative AI by Kiran Tomlinson, Sonia Jaffe, Will Wang, Scott Counts, and Siddharth Suri (arXiv:2507.07935) offers some of the most unequivocal answers yet, revealing the veil on the work activity people actually do with the AI and the ways in which the AI itself works. This is not theory but close look at hundreds of thousands of anonymized conversations that give us glimpse into the nascent dance between humans and their co-pilot AIs. What we end up with is nuanced profile, one that defies popular narratives and needs nuanced treatment of the work of the future.
To have insight into the impacts of AI, the researchers looked into corpus of 200,000 anonymized and privacy-cleared dialogues between Microsoft Bing Copilot and its users. Their state-of-the-art approach deviates from two crucial components of every interaction between humans and AIs: the user goal and the action of the AI. Just think about it: the user goal is what the human seeks to accomplish courtesy of the AI. So, for instance, if you're experiencing technical issue, then your user goal would be something like "resolve computer problems". The AI action, conversely, is the activity the AI actually performs in response – in the same case above, the AI’s action would be to "provide technical support".
The reason we draw this line between user goal and AI action is significant is that we can better recognize just how much the former is complementing human activity and yet in some cases actually doing the human’s work. So what do people do most with generative AI? The research reveals the most common user aims in Copilot dialogs are information gathering, writing, and communicating with other people. These are the things people are most turning to the aid of the AI for. And what about the AI itself? The most common behaviors of the AI most often place it in the service role. It' most often providing information and assisting, writing, teaching, and advising. Look at the following illustration: while the user is perhaps trying to pull out information for report (user goal), the AI is perhaps providing the information (AI action). Notice this lack of correlation: in 40% of the dialogs the user goal and the AI action applied disjointed sets of work activity. The AI is more adept at advising and teaching and the user is using it for information gathering, reading, and research.
More often assisted by AI | More often performed by AI |
---|---|
Purchase goods or services. (118.4x) | Train others on operational procedures. (17.9x) |
Execute financial transactions. (58.8x) | Train others to use equipment or products. (16.0x) |
Perform athletic activities. (47.3x) | Distribute materials, supplies, or resources. (11.2x) |
Obtain information about goods or services. (25.9x) | Train others on health or medical topics. (11.2x) |
Research healthcare issues. (20.5x) | Provide general assistance to others. (10.9x) |
Prepare foods or beverages. (14.7x) | Coach others. (10.6x) |
Research technology designs or applications. (13.5x) | Provide information to clients/customers. (8.6x) |
Obtain formal documentation or authorization. (12.5x) | Advise others on workplace health/safety. (7.5x) |
Operate office equipment. (11.4x) | Teach academic or vocational subjects. (6.6x) |
Investigate incidents or accidents. (11.3x) | Teach safety procedures or standards. (6.5x) |
Note: Only includes IWAs with user or AI activity share ≥ 0.05%. Numbers show IWA overrepresentation form. |
Source 1: Working with AI:Measuring the Occupational Implications of Generative AI
To quantify this effect on spectrum of occupations, the researcher developed an AI applicability score per job. This is not just raw usage in vacuum; the score includes all the following:
These highest-scoring jobs on this applicability of AI scale are mostly knowledge work and communication-intensive jobs. Those would be jobs like:
Job Title (Abbrv.) | Coverage | CmpltN. | Scope | Score | Employment |
---|---|---|---|---|---|
Interpreters and Translators | 0.98 | 0.88 | 0.57 | 0.49 | 51,560 |
Historians | 0.91 | 0.85 | 0.56 | 0.48 | 3,040 |
Passenger Attendants | 0.80 | 0.88 | 0.62 | 0.47 | 20,190 |
Sales Representatives of Services | 0.84 | 0.90 | 0.57 | 0.46 | 1,142,020 |
Writers and Authors | 0.85 | 0.84 | 0.60 | 0.45 | 49,450 |
Customer Service Representatives | 0.72 | 0.90 | 0.59 | 0.44 | 2,858,710 |
CNC Tool Programmers | 0.90 | 0.87 | 0.53 | 0.44 | 28,030 |
Telephone Operators | 0.80 | 0.86 | 0.57 | 0.42 | 4,600 |
Ticket Agents and Travel Clerks | 0.71 | 0.90 | 0.56 | 0.41 | 119,270 |
Broadcast Announcers and Radio DJs | 0.74 | 0.84 | 0.60 | 0.41 | 25,070 |
Brokerage Clerks | 0.74 | 0.89 | 0.57 | 0.41 | 48,060 |
Farm and Home Management Educators | 0.77 | 0.91 | 0.55 | 0.41 | 8,110 |
Telemarketers | 0.66 | 0.89 | 0.60 | 0.40 | 50,790 |
Concierges | 0.70 | 0.88 | 0.56 | 0.40 | 41,020 |
Political Scientists | 0.77 | 0.87 | 0.53 | 0.39 | 5,580 |
News Analysts, Reporters, Journalists | 0.81 | 0.81 | 0.56 | 0.39 | 45,020 |
Mathematicians | 0.91 | 0.74 | 0.54 | 0.39 | 2,220 |
Technical Writers | 0.83 | 0.82 | 0.54 | 0.38 | 47,970 |
Proofreaders and Copy Markers | 0.91 | 0.86 | 0.49 | 0.38 | 5,490 |
Hostesses | 0.60 | 0.90 | 0.57 | 0.37 | 425,020 |
Editors | 0.78 | 0.82 | 0.54 | 0.37 | 95,700 |
Business Teachers, Postsecondary | 0.70 | 0.90 | 0.52 | 0.37 | 82,980 |
Public Relations Specialists | 0.63 | 0.90 | 0.60 | 0.36 | 275,550 |
Demonstrators and Product Promoters | 0.64 | 0.88 | 0.53 | 0.36 | 50,790 |
Advertising Sales Agents | 0.66 | 0.90 | 0.53 | 0.36 | 108,100 |
New Accounts Clerks | 0.72 | 0.87 | 0.51 | 0.36 | 41,180 |
Statistical Assistants | 0.85 | 0.84 | 0.49 | 0.36 | 7,200 |
Counter and Rental Clerks | 0.62 | 0.90 | 0.52 | 0.36 | 390,300 |
Data Scientists | 0.77 | 0.86 | 0.51 | 0.36 | 272,190 |
Personal Financial Advisors | 0.69 | 0.88 | 0.52 | 0.35 | 272,190 |
Archivists | 0.66 | 0.88 | 0.49 | 0.35 | 7,150 |
Economics Teachers, Postsecondary | 0.68 | 0.90 | 0.51 | 0.35 | 12,210 |
Web Developers | 0.73 | 0.86 | 0.51 | 0.35 | 85,350 |
Management Analysts | 0.68 | 0.90 | 0.54 | 0.35 | 838,140 |
Geographers | 0.77 | 0.83 | 0.48 | 0.35 | 1,460 |
Models | 0.64 | 0.89 | 0.53 | 0.35 | 3,090 |
Market Research Analysts | 0.70 | 0.90 | 0.52 | 0.35 | 846,370 |
Public Safety Telecommunicators | 0.66 | 0.88 | 0.53 | 0.35 | 97,820 |
Switchboard Operators | 0.68 | 0.86 | 0.52 | 0.34 | 43,830 |
Library Science Teachers, Postsecondary | 0.65 | 0.90 | 0.51 | 0.34 | 4,220 |
Note: Metrics reported as mean of user goal and AI action score. |
Source 2 Working with AI:Measuring the Occupational Implications of Generative AI
Conversely, least applicability on the basis of AI is the case of jobs requiring manual work, operation of machines, or body contact with humans. Think about jobs like Nursing Assistants, Massage Therapists, Dishwashers, or Pile Driver Operators. This is not surprise – an LLM bot is essentially not suited for manual tasks or direct control over machinery.
Job Title (Abbrv.) | Coverage | CmpltN. | Scope | Score | Empl. |
---|---|---|---|---|---|
Phlebotomists | 0.07 | 0.95 | 0.29 | 0.03 | 137,080 |
Nursing Assistants | 0.07 | 0.85 | 0.34 | 0.03 | 1,351,760 |
Hazardous Materials Removal Workers | 0.04 | 0.95 | 0.35 | 0.03 | 49,960 |
Helpers-Painters, Plasterers, ... | 0.04 | 0.94 | 0.38 | 0.03 | 7,700 |
Embalmers | 0.07 | 0.55 | 0.22 | 0.03 | 3,380 |
Plant and System Operators, All Other | 0.05 | 0.93 | 0.38 | 0.03 | 15,370 |
Oral and Maxillofacial Surgeons | 0.05 | 0.89 | 0.34 | 0.03 | 4,160 |
Automotive Glass Installers and Repairers | 0.04 | 0.93 | 0.34 | 0.03 | 16,890 |
Ship Engineers | 0.05 | 0.92 | 0.39 | 0.03 | 8,860 |
Tire Repairers and Changers | 0.04 | 0.95 | 0.35 | 0.02 | 101,520 |
Prosthodontists | 0.10 | 0.90 | 0.29 | 0.02 | 570 |
Helpers-Production Workers | 0.04 | 0.93 | 0.36 | 0.02 | 181,810 |
Highway Maintenance Workers | 0.03 | 0.96 | 0.32 | 0.02 | 150,860 |
Medical Equipment Preparers | 0.04 | 0.96 | 0.31 | 0.02 | 66,790 |
Packaging and Filling Machine Op. | 0.04 | 0.91 | 0.39 | 0.02 | 371,600 |
Machine Feeders and Offbearers | 0.05 | 0.89 | 0.36 | 0.02 | 44,500 |
Dishwashers | 0.03 | 0.95 | 0.30 | 0.02 | 463,940 |
Cement Masons and Concrete Finishers | 0.03 | 0.92 | 0.39 | 0.01 | 203,560 |
Supervisors of Firefighters | 0.04 | 0.88 | 0.39 | 0.01 | 84,120 |
Industrial Truck and Tractor Operators | 0.03 | 0.94 | 0.28 | 0.01 | 778,920 |
Ophthalmic Medical Technicians | 0.04 | 0.89 | 0.33 | 0.01 | 73,390 |
Massage Therapists | 0.10 | 0.91 | 0.32 | 0.01 | 92,650 |
Surgical Assistants | 0.03 | 0.78 | 0.29 | 0.01 | 18,780 |
Tire Builders | 0.03 | 0.93 | 0.40 | 0.01 | 20,660 |
Drivers-Sales Workers | 0.02 | 0.94 | 0.37 | 0.01 | 4,540 |
Logging Equipment Operators | 0.01 | 0.96 | 0.47 | 0.01 | 4,400 |
Gas Compressor and Gas Pumping Station Op. | 0.01 | 0.92 | 0.38 | 0.01 | 135,140 |
Roofers | 0.02 | 0.94 | 0.38 | 0.01 | 43,830 |
Roustabouts, Oil and Gas | 0.01 | 0.95 | 0.39 | 0.01 | 836,230 |
Maids and Housekeeping Cleaners | 0.02 | 0.94 | 0.34 | 0.01 | 43,080 |
Paving, Surfacing, and Tamping Equipment Op. | 0.00 | 0.96 | 0.29 | 0.01 | 43,080 |
Motorboat Operators | 0.01 | 0.93 | 0.39 | 0.01 | 2,710 |
Orderlies | 0.00 | 0.76 | 0.18 | 0.00 | 48,710 |
Floor Sanders and Finishers | 0.00 | 0.94 | 0.34 | 0.00 | 5,070 |
Pile Driver Operators | 0.00 | 0.98 | 0.24 | 0.00 | 3,010 |
Rail-Track Laying and Maintenance Equip. Op. | 0.00 | 0.96 | 0.27 | 0.00 | 18,770 |
Foundry Mold and Coremakers | 0.00 | 0.95 | 0.36 | 0.00 | 11,780 |
Water Treatment Plant and System Op. | 0.00 | 0.92 | 0.44 | 0.00 | 120,710 |
Bridge and Lock Tenders | 0.00 | 0.93 | 0.39 | 0.00 | 3,460 |
Dredge Operators | 0.00 | 0.99 | 0.22 | 0.00 | 940 |
Note: Metrics reported as mean of user goal and AI action score. |
Source 3 Working with AI:Measuring the Occupational Implications of Generative AI
Interestingly, the paper' results follow previous estimates of the effects of work from AI fairly closely, .g., from Eloundou et al. (Tyna Eloundou, Sam Manning, Pamela Mishkin, and Daniel Rock. GPTs are GPTs: Labor market impact potential of LLMs. Science, 384(6702):1306–1308, 2024.) and show very high correlation (0.73 on the occupation level and remarkable 0.91 on the most aggregated level of the broadest occupation group). This suggests that our intuition about the potential scope of the coverage of AI is about right on the average.
Common debate often frames the impact of AI in zero-sum modes: augmentation (AI works with humans to do things better) or automation (AI takes over tasks entirely). But the sources note that the capability of new tech has no direct impact on downstream business decisions. The venerable analogy of the ATM reveals the point just so: even as the ATM automated the very core function of the bank teller, jobs for bank tellers actually increased as branches proliferated and the tellers became relationship-builders.
This work substantiates the nuance. Even though action is taken by the AI, its core value today lies in amplification. Research and writing work, for instance, not only represent the most common user goals but also have the most positive user satisfaction and the most work completed. That means Copilot is actually proving useful in the above work, indicating that it is being used as an incredibly efficient productivity amplifier. It' more capable on the writing and research sides of knowledge work than on advanced data analysis or visualization design. "The activities that AI performs are very different from the user goals the AI assists: in 40% of conversations, these sets are disjoint." This alignment of user objectives and the activity of the AI works to further highlight this complement. In support of the user, the AI assists the user on more work than it accomplishes directly. The AI then moves into the mode of coaching, training, and advising and helps the user accomplish his or her knowledge work activity. For example, Copilot might help the person prepare meal by recommending recipes but actually doing the work of preparing the meal. This suggests an interactive rather than purely substitutable relation.
One of the most popular questions about the effect of AI on the labour market is about wages and schooling. Will the effect be greater on low-wage labour or on high-skilled jobs? Analysis found "weak and inconsistent relationship" between applicability score and average occupation wage for AI. Whereas prior work forecasted substantial impact on well-paying jobs, this usage data provides the inverse scenario and suggests the relationship is less direct than previously thought. Sales and office support jobs with high employment rates have relatively low wages but possess high applicability for AI. If we look at the schooling requirements, there' slightly more evident pattern: Bachelor' jobs have higher applicability score for AI than do jobs requiring less schooling. But the researchers note that there still exists massive overlap and extensive distribution of potential impact even in every wage and every education bucket. This suggests the narrative about the impact of AI only being driven in some parts of the workforce is incomplete.
What this suggests has important implications for adaptation and reskilling policies. If the impact of AI cannot be perfectly divided according to wage or schooling tiers, then the adaptation issue becomes more pervasive. This makes the argument for cross-cutting policies over group-oriented policies even more important.
Even as this work provides unprecedented insights, it also signals the vastness of the unknowns that lie ahead. The researchers candidly note the following limitations:
These limitations point out key questions still unanswered and signal key future research directions:
This is not only tech phenomenon; it' societal one. This work provides convincing data-driven proof that today' AI is embedded in much of today' knowledge work and communication. It' even helping us to gather information, write, and communicate better, almost entirely in the role of helper. The "human-in-the-loop" is far from disappearing, but the nature of the loop is essentially changing. Organisations must be tactically considerate in the ways they integrate AI so as to maximize augmentation and unlock the potential of the workforce rather than target and try to replace them. Policymakers face the near-term obligation of building lively reskilling programmes and safety nets that account for the widespread, not narrowly designed, impacts of AI.
For individuals, the take-away is straightforward:
Work' future is not predetermined point but landscape we ourselves are forging. If we understand the practical ground impacts rather than react to exaggerated reports, we collectively can steer this massive technology towards future that elevates human abilities and supports inclusive economic growth. The debate has begun; the difficult work of thoughtful adaptation remains.