Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
..
3ff10309ac
Add new 2019 deadtree shapefile and description of processing steps prior to DVC tracking
2 years ago
ff6b1a01fb
Add computestats stage to calculate mean and std of subtiles for normalization
2 years ago
89215020b1
Add inference stats to stages
2 years ago
1c02e2846e
Make the model inference type selectable (#20)
3 years ago
8a19382226
Consolidate dataset creation into a single script (incl. negative and random samples, ds balancing)
2 years ago
2ab0af823f
Add negative samples to createmask stage and general script cleanup
2 years ago
c6f4048bc7
Allow the user to train in rgb, rgbn and single and multi class mode by changing hparams in conf
2 years ago

README.MD

You have to be logged in to leave a comment. Sign In

Preprocessing Pipeline 🪠

Stage 0 (not tracked)

Data origin

Data is sources from the The luxembourgish data platform.

Data is provided as JPEG2000 encoded 3-band images with 20cm resolution and projected in EPSG:2169 (LUREF).

Initial recombination and reprojection

Prior to the DVC-tracked preprocessing steps the raw data was reorganized as follows (the final output files are tracked in this repo though):

# year 2017 data
YEAR=2017
RGB=ortho2017_RVB_pays.jp2
NIR=LUX17_CIR_v3.jp2

# year 2019 data
#YEAR=2019
#RGB=ortho_2019.jp2
#NIR=ortho_2019ir.jp2

# merge bands (allow projection difference since one source has a differnt projection string but correct projection parameters)
gdalbuildvrt b1.vrt -b 1 ${RGB} 
gdalbuildvrt b2.vrt -b 2 ${RGB}
gdalbuildvrt b3.vrt -b 3 ${RGB}
gdalbuildvrt b4.vrt -b 1 ${NIR}
gdalbuildvrt -allow_projection_difference -separate allbands.vrt b1.vrt b2.vrt b3.vrt b4.vrt

gdal_translate -of GTiff -co "COMPRESS=LZW" -co "PREDICTOR=2" -co "TILED=YES" -co "BIGTIFF=YES" -co "ALPHA=NO" -colorinterp_1 "red" -colorinterp_2 "green" -colorinterp_3 "blue" allbands.vrt ortho_ms_${YEAR}.tif

# reproject and limit extent to common area
gdalwarp -co "NUM_THREADS=ALL_CPUS" -multi  -overwrite -ot Byte -ts 302346 420442 -te 263584.672 5479988.018 324060.605 5564085.859 -t_srs '+proj=utm +zone=32 +ellps=GRS80 +towgs84=0,0,0,0,0,0,0 +units=m +no_defs' -r average ortho_ms_2017.tif ortho_ms_${YEAR}_EPSG3044.vrt
gdal_translate -co "NUM_THREADS=ALL_CPUS" -co "COMPRESS=LZW" -co "PREDICTOR=2" -co "TILED=YES" -co "BIGTIFF=YES" -co "ALPHA=NO" ortho_ms_${YEAR}_EPSG3044.vrt ortho_ms_${YEAR}_EPSG3044.tif

Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...