Result: Taxonomy OWLizer: A Tool for Converting Taxonomic Data into OWL

Title:
Taxonomy OWLizer: A Tool for Converting Taxonomic Data into OWL
Contributors:
Mathématiques, Informatique et STatistique pour l'Environnement et l'Agronomie (MISTEA), Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)-Institut Agro Montpellier, Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro), Universidade de São Paulo = University of São Paulo (USP), University of Twente, Embrapa Solos, Ministério da Agricultura
Source:
Biodiversity Information Science and Standards. 9:e176413-e176413
Publisher Information:
CCSD; Sofia : Pensoft Publishers, 2017, 2025.
Publication Year:
2025
Collection:
collection:INSTITUT-AGRO-MONTPELLIER
collection:INRAE
collection:INRAEOCCITANIEMONTPELLIER
collection:MISTEA
collection:MATHNUM
collection:RESEAU-EAU
collection:INSTITUT-AGRO
collection:TEST-MATHNUM
Original Identifier:
HAL: hal-05374839
Document Type:
Journal article<br />Journal articles
Language:
English
ISSN:
2535-0897
Relation:
info:eu-repo/semantics/altIdentifier/doi/10.3897/biss.9.176413
DOI:
10.3897/biss.9.176413
Rights:
info:eu-repo/semantics/OpenAccess
URL: http://creativecommons.org/licenses/by/
Accession Number:
edshal.hal.05374839v1
Database:
HAL

Further Information

The growing demand for high-quality, interoperable biodiversity data exposes the current challenges of standardizing taxonomic information across platforms. While the Global Biodiversity Information Facility Backbone Taxonomy (GBIF Secretariat 2023) offers a valuable reference for species names, translating this information into formal ontology structures for semantic integration remains a complex task. A key challenge lies in the dynamic nature of taxonomic knowledge: as GBIF-BT is continuously updated, ontologies built manually from its data risk becoming outdated or misaligned. Without automated mechanisms for synchronizing with these updates, such ontologies may fail to reflect current taxonomic consensus, limiting their interoperability and long-term usefulness in biodiversity informatics. To address this problem, we developed the Taxonomy OWLizer (TOWLizer), a lightweight web application that allows users to convert species names into Web Ontology Language (OWL) ontologies based on taxonomic data retrieved directly from the GBIF Species API, as detailed in Fig. 1. Users input one or multiple scientific names, and the application fetches, organizes, and outputs a structured OWL file, facilitating taxonomic integration into semantic web projects. OWLizer automates synonym handling (see Fig. 2) and taxonomic hierarchy generation, and reuses GBIF URIs in the code, following linked data principles (Berners-Lee 2006). It aims to lower the technical barriers for researchers working in biodiversity informatics who need machine-actionable taxonomies but may lack programming expertise. The application was implemented using HTML, JavaScript, Bootstrap, and a Flask backend, with ChatGPT-4 providing support for code drafting and debugging, which accelerated prototyping and streamlined development. The tool is freely available through a GitHub-hosted interface with the backend deployed on Render, and a backup archived in Zenodo (Soares 2025). TOWLizer has some limitations though. Its performance depends on free-tier hosting services: the backend may enter a sleep state after periods of inactivity, causing occasional delays, while caching is limited to local storage on a single device and browser. The algorithm is also sensitive to typographical errors in scientific names; a recommended workflow is to combine ChatGPT for spelling verification with TOWLizer for synonym management. Finally, GBIF URLs used in the application do not yet support content negotiation, which constrains their reusability in some semantic web contexts. We tested TOWLizer in the development of a real-world ontology, namely the Agricultural Product Types Ontology (APTO), which was designed to represent agricultural commodities in Brazil. In this context, we addressed the content negotiation issue by replacing the GBIF URLs with URIs from the APTO namespace. These URIs are served via AgroPortal (Jonquet et al. 2018), enabling proper content negotiation. Despite these constraints, the Taxonomy OWLizer has proven to be a practical and accessible way to generate OWL representations of taxonomic data. It provides an initial step toward automating taxonomic ontology construction, highlights the potential of AI-assisted development, and contributes to ongoing discussions in biodiversity informatics about sustainable, interoperable, and machine-actionable taxonomies. More details on the tool development are provided in Soares et al. 2025.