OpenAI's GPT-4 and Anthropic's Claude 2 are the most capable large language models when compared with the GSM8k benchmark in 2023. The rapid increase in AI capabilities is noticeable when comparing the scores of GPT-3.5 and GPT-4, where the latter iteration scored 35 percent higher than the former.
GSM8k benchmark comparison between major generative artificial intelligence (AI) programs in 2023
Profit from the additional features of your individual account
Currently, you are using a shared account. To use individual functions (e.g., mark statistics as favourites, set
statistic alerts) please log in with your personal account.
If you are an admin, please authenticate by logging in again.
Learn more about how Statista can support your business.
xAI. (November 4, 2023). GSM8k benchmark comparison between major generative artificial intelligence (AI) programs in 2023 [Graph]. In Statista. Retrieved November 10, 2024, from https://www-statista-com.ezproxy.canberra.edu.au/statistics/1447745/gsm8k-benchmark-comparison-of-major-ai-programs/
xAI. "GSM8k benchmark comparison between major generative artificial intelligence (AI) programs in 2023." Chart. November 4, 2023. Statista. Accessed November 10, 2024. https://www-statista-com.ezproxy.canberra.edu.au/statistics/1447745/gsm8k-benchmark-comparison-of-major-ai-programs/
xAI. (2023). GSM8k benchmark comparison between major generative artificial intelligence (AI) programs in 2023. Statista. Statista Inc.. Accessed: November 10, 2024. https://www-statista-com.ezproxy.canberra.edu.au/statistics/1447745/gsm8k-benchmark-comparison-of-major-ai-programs/
xAI. "Gsm8k Benchmark Comparison between Major Generative Artificial Intelligence (Ai) Programs in 2023." Statista, Statista Inc., 4 Nov 2023, https://www-statista-com.ezproxy.canberra.edu.au/statistics/1447745/gsm8k-benchmark-comparison-of-major-ai-programs/
xAI, GSM8k benchmark comparison between major generative artificial intelligence (AI) programs in 2023 Statista, https://www-statista-com.ezproxy.canberra.edu.au/statistics/1447745/gsm8k-benchmark-comparison-of-major-ai-programs/ (last visited November 10, 2024)
GSM8k benchmark comparison between major generative artificial intelligence (AI) programs in 2023 [Graph], xAI, November 4, 2023. [Online]. Available: https://www-statista-com.ezproxy.canberra.edu.au/statistics/1447745/gsm8k-benchmark-comparison-of-major-ai-programs/