Synthetic Data Generation Using Large Language Models: Advances in Text and Code

This survey reviews how large language models (LLMs) are transforming synthetic training data generation in both natural language and code domains. By producing artificial but task-relevant examples, these models can significantly augment or even substitute for real-world datasets, particularly in s...

Full description

Saved in:
Bibliographic Details
Main Authors: Mihai Nadas, Laura Diosan, Andreea Tomescu
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11080380/
Tags: Add Tag
No Tags, Be the first to tag this record!