Is one run enough? Reproducibility of flagship large language models across temperature and reasoning settings in biomedical text processing
20260 citationsJournal Articlehybrid Open Access
Field-Weighted Citation Impact: 0.00
Is one run enough? Reproducibility of flagship large language models across temperature and reasoning settings in biomedical text processing | Researchclopedia