Comparative analysis of leading artificial intelligence chatbots in the context of entrepreneurship

Abstract Artificial intelligence (AI) chatbots show remarkable abilities across applications. Despite a growing literature, their capability in the field of entrepreneurship is not fully understood. The aim of this study is to empirically evaluate and compare capabilities of five major AI chatbots—G...

Full description

Saved in:
Bibliographic Details
Main Authors: Firuz Kamalov, David Santandreu Calonge, Patrik T. Hultberg, Linda Smail, Dima Jamali
Format: Article
Language:English
Published: SpringerOpen 2025-06-01
Series:Journal of Innovation and Entrepreneurship
Subjects:
Online Access:https://doi.org/10.1186/s13731-025-00527-3
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Artificial intelligence (AI) chatbots show remarkable abilities across applications. Despite a growing literature, their capability in the field of entrepreneurship is not fully understood. The aim of this study is to empirically evaluate and compare capabilities of five major AI chatbots—GPT-3.5, GPT-4, Gemini 1.0, Llama 2, and Claude—in the context of entrepreneurship theory, using a benchmark entrepreneurship test. In particular, the performance of the chatbots on a set of multiple-choice questions, short-answer questions, and essay questions related to entrepreneurship is assessed. The results indicate that GPT-4 delivers the strongest overall performance. Meanwhile, Llama 2 offers precise responses with a significantly lower word count compared to the GPT models. Although chatbots do not always provide correct or precise answers to questions or complex prompts, they still prove to be valuable analytical tools for entrepreneurs. While the study offers compelling insights into chatbots’ grasp of entrepreneurship concepts, the findings are somewhat limited by the scarce availability of data.
ISSN:2192-5372