terrabyte
terrabyte is an innovative High Performance Data Analytics (HPDA) platform operated by DLR and the Leibniz Supercomputing Center (LRZ) of the Bavarian Academy of Sciences and Humanities. The platform offers scientists efficient access to Earth observation data, a high-performance processing environment and practical tools for data analysis. terrabyte provides a STAC API for efficient access to a selected number of earth observation products.
Getting Access to the terrabyte platform¤
Access is granted to scientist at DLR and LRZ for non commercial science applications. More detailed information can be found at the official terrabyte documentation.
The files are not downloaded since they are already on the local filesystem. Instead a sources.txt file is created in the output folder.
Output data¤
In contrast to USGS, CDSE and EOSDIS no files are downloaded. This is because all files are already on the server. Therefore the output folder will contain a text file calles sources.txt containing for each search result the path. The files are not copied to reduce the memory usage and unnecessary time spend on copying files.
Parameters¤
| Name | Type | Description |
|---|---|---|
out_dir |
str |
Output directory. Will contain the resulting sources.txt file. |
collection |
str |
The collection to query. A list of all possible collections can be obtained by TerraByte.available_collections(). Some common collection are also available as enumerations in TerraByte.Collection. |
shapefile_path |
str |
Path to a valid shapefile. Supported formats: .shp, .geojson, .kml, .gpkg. You can find shapefiles from sources like OSM Boundaries. |
start_date |
str |
Start date for temporal filtering. Date string muste be in format yyyy-mm-dd. |
end_date |
str |
End date for temporal filtering. Date string muste be in format yyyy-mm-dd.Default is None which meants until the present. |
naming_filter |
str |
Filter the result by name e.x. if the naming convention contains the product type, processing level or instrument. This is not a REGEX sequence. - * matches any sequence of characters\br>(including none).- ? matches a single character.- [seq] matches any character in the sequence.- [!seq] matches any character not in the sequence.Default is None. |
Example¤
from sipt.retrieval import TerraByte
output_dir = "./temp/src"
shapefile_path = "path/to/your/shapefile.geojson"
start_date = '2023-07-01'
end_date = '2024-08-01'
dataset = TerraByte.Collection.LANDSAT_8_9_OT_C2_L2
naming_filter = None
# Start the retrieval
TerraByte.retrieve(output_dir,
collection,
shapefile_path,
start_date,
end_date,
naming_filter)