Synthetic Data Generation Using Large Language Models: Advances in Text and Code
This survey reviews how large language models (LLMs) are transforming synthetic training data generation in both natural language and code domains. By producing artificial but task-relevant examples, these models can significantly augment or even substitute for real-world datasets, particularly in s...
Saved in:
| Main Authors: | Mihai Nadas, Laura Diosan, Andreea Tomescu |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11080380/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Synthetic Text as Data: On Usefulness and Limitations
by: Seoyeon Choi, et al.
Published: (2025-05-01) -
The Importance of AI Data Governance in Large Language Models
by: Saurabh Pahune, et al.
Published: (2025-05-01) -
A Study on Text Classification in the Age of Large Language Models
by: Paul Trust, et al.
Published: (2024-11-01) -
Leveraging Synthetic Data for Improved Manipuri-English Code-Switched ASR
by: Naorem Karline Singh, et al.
Published: (2025-01-01) -
The performance of the LSTM-based code generated by Large Language Models (LLMs) in forecasting time series data
by: Saroj Gopali, et al.
Published: (2024-12-01)