Treffer: Dataset: Application of Remote Sensing and Machine Learning Algorithms for Shipwreck Susceptibility Mapping in China

Title:
Dataset: Application of Remote Sensing and Machine Learning Algorithms for Shipwreck Susceptibility Mapping in China
Authors:
Publisher Information:
Zenodo
Publication Year:
2025
Collection:
Zenodo
Document Type:
dataset
Language:
unknown
DOI:
10.5281/zenodo.17204584
Rights:
Creative Commons Attribution 4.0 International ; cc-by-4.0 ; https://creativecommons.org/licenses/by/4.0/legalcode
Accession Number:
edsbas.9BAE0B7C
Database:
BASE

Weitere Informationen

README This repository contains the computational code and data that accompanies the manuscript: Title: Application of Remote Sensing and Machine Learning Algorithms for Shipwreck Susceptibility Mapping in China Authors: Junhui Chen, Fei Tang, Heshan Lin, Yong Chen, Yuyue Chen, Peiru,Lin, Bo Huang, and Xueping Lin Repository Contents This repository contains the computational code and data used to generate the results presented in the manuscript. Files: ANNPrediction0521.py, RFPrediction0521.py, SVMprediction0521.py: These are the primary Python scripts. When run, they will generate three corresponding output folders: ANN0521, RF0521, and SVM0521. Fig8.py: This script generates Figure 8 of the manuscript using the output from the three main prediction scripts (ANNPrediction0521.py, RFPrediction0521.py, and SVMprediction0521.py). training2025new_FR.gdb, trainingset.txt, validatingset.txt: These are the input data files required by the scripts. Name_Chinese_and_English.xlsx: This spreadsheet provides the Chinese and English names for the 16 conditioning factors mentioned in the article, as the code uses the Chinese names while the article uses the English ones. requirements.txt: This file lists the exact versions of the Python packages required to run the code. Software and Dependencies The code was developed using PyCharm Community Edition 2023.2 with Python 3.11.0. To run the scripts, the following key Python packages are required: rasterio: For reading, writing, and manipulating geospatial raster data. pandas: For data manipulation and analysis, particularly for handling the input .txt files. matplotlib: For plotting and saving figures, such as the ROC curves. scikit-learn: For machine learning tasks, including model training (MLPClassifier), evaluation, and hyperparameter tuning (GridSearchCV). joblib: For saving and loading the trained machine learning model. numpy: For numerical operations, especially for handling arrays and geospatial data. A requirements.txt file is included in the repository, listing the ...