CMap: a database for mapping job titles, sector specialization, and promotions across 24 sectors

Abstract Understanding job titles, career trajectories, and promotions provides valuable insight into labor market dynamics and patterns of professional mobility. We introduce Career Map (CMap), a novel, large-scale dataset spanning 24 industry sectors, designed to support the study of job specializ...

Full description

Saved in:
Bibliographic Details
Main Authors: Shehryar Subhani, Shahan Ali Memon, Bedoor AlShebli
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-05526-3
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Understanding job titles, career trajectories, and promotions provides valuable insight into labor market dynamics and patterns of professional mobility. We introduce Career Map (CMap), a novel, large-scale dataset spanning 24 industry sectors, designed to support the study of job specialization, sectoral concentration, and career advancement. Using natural language processing techniques and large language models, we standardize 5.2 million job titles into 123 thousand unique titles and propose a Specialization Index to quantify how concentrated a given title is within a sector. The dataset includes both a structured job titles dataset and a set of identified promotions—32 thousand validated promotions from the United States and the United Kingdom, and 61 thousand inferred promotions from a global context. CMap enables research on job hierarchies, cross-sector mobility, and systemic inequalities in professional advancement. It provides a foundation for examining how education, experience, and institutional structures shape career outcomes across industries and regions, offering a valuable resource for economists, sociologists, and computational social scientists.
ISSN:2052-4463