terrabyte

terrabyte is an innovative High Performance Data Analytics (HPDA) platform operated by DLR and the Leibniz Supercomputing Center (LRZ) of the Bavarian Academy of Sciences and Humanities. The platform offers scientists efficient access to Earth observation data, a high-performance processing environment and practical tools for data analysis. terrabyte provides a STAC API for efficient access to a selected number of earth observation products.

Getting Access to the terrabyte platform¤

Access is granted to scientist at DLR and LRZ for non commercial science applications. More detailed information can be found at the official terrabyte documentation.

The files are not downloaded since they are already on the local filesystem. Instead a sources.txt file is created in the output folder.

Output data¤

In contrast to USGS, CDSE and EOSDIS no files are downloaded. This is because all files are already on the server. Therefore the output folder will contain a text file calles sources.txt containing for each search result the path. The files are not copied to reduce the memory usage and unnecessary time spend on copying files.

Parameters¤

Name Type Description
out_dir str Output directory. Will contain the resulting sources.txt file.
collection str The collection to query. A list of all possible collections can be obtained by TerraByte.available_collections(). Some common collection are also available as enumerations in TerraByte.Collection.
shapefile_path str Path to a valid shapefile. Supported formats: .shp, .geojson, .kml, .gpkg. You can find shapefiles from sources like OSM Boundaries.
start_date str Start date for temporal filtering. Date string muste be in format yyyy-mm-dd.
end_date str End date for temporal filtering. Date string muste be in format yyyy-mm-dd.
Default is None which meants until the present.
naming_filter str Filter the result by name e.x. if the naming convention contains the product type, processing level or instrument. This is not a REGEX sequence.
- * matches any sequence of characters\br>(including none).
- ? matches a single character.
- [seq] matches any character in the sequence.
- [!seq] matches any character not in the sequence.
Default is None.

Example¤

Python
from sipt.retrieval import TerraByte

output_dir = "./temp/src"
shapefile_path = "path/to/your/shapefile.geojson"
start_date = '2023-07-01'
end_date = '2024-08-01'
dataset = TerraByte.Collection.LANDSAT_8_9_OT_C2_L2
naming_filter = None

# Start the retrieval
TerraByte.retrieve(output_dir,
                            collection,
                            shapefile_path,
                            start_date,
                            end_date,
                            naming_filter)