Large language models possess some ecological knowledge, but how much?

20260 citationsJournal Articlegold Open Access

Authors

Filip Dorm · École Polytechnique Fédérale de Lausanne

Joseph Millard · University of Cambridge

Drew W. Purves · Google DeepMind (United Kingdom)

Oisin Mac Aodha · University of Edinburgh

Abstract

Large Language Models (LLMs) have shown remarkable capabilities in question answering across various domains, yet their effectiveness in ecological knowledge remains underexplored. Understanding their potential to recall and synthesize ecological information is crucial as AI tools become increasingly integrated into scientific workflows. Here, we assess the ecological knowledge of two LLMs, Gemini 1.5 Pro and GPT-4o , across a suite of ecologically focused tasks. These tasks evaluate an LLM’s ability to predict species presence, generate range maps, list critically endangered species, classify threats, and estimate species traits. We introduce a new benchmark dataset to quantify LLM performance against expert-derived data. While the LLMs tested outperform naive baselines, achieving around 20 percentage points higher accuracy in species presence prediction, they reach only a third of the mean F1 score for range map generation and improve threat classification by just around 10 points over random guessing. These results highlight both the promise and challenges of applying LLMs in ecology. Our findings suggest that domain-specific fine-tuning is necessary to improve ecological knowledge in LLMs. By providing a repeatable evaluation framework, our benchmark dataset will facilitate future research in this area, helping to refine AI applications for ecological science. • We introduce a benchmark to assess ecological knowledge in LLMs. • Two LLMs, Gemini Pro 1.5 and GPT-4o, are evaluated on five ecological tasks. • LLMs estimate species presence well but struggle with threats and range mapping. • Further domain-specific tuning is needed to improve ecological performance in LLMs.

Topics & Keywords

Language and cultural evolution Computational and Text Analysis Methods Animal Vocal Communication and Behavior

UN Sustainable Development Goals

Quality Education

Publication Details

Published in: Ecological Informatics

Volume 95, pp. 103699-103699

DOI: 10.1016/j.ecoinf.2026.103699

Field-Weighted Citation Impact: 0.00