Pipeline Description¶
Visual Overview¶
graph TD
A["Lot Preparation
*(chimcla split-into-lots)*"]
A -->|separated lots|B
B["Form Image Preprocessing
*(chimcla create-
production-images)*"]
B --> C[Bar-Separation]
C --> D["Brightness evaluation
(S-Value calculation)"]
D -->F["Step History
Evaluation
*(chimcla_step
_history_eval)*"]
D -->G["Generation of
Experimental Images
*(chimcla_ced)*"]
D --> E["Generation of
CNN Training Data
*(chimcla_create
_work_images)*"]
click A "#step-0" "go to respective section" _self
click B "#step-1" "go to respective section" _self
click C "#step-2" "go to respective section" _self
click D "#step-3" "go to respective section" _self
Expected Directory Structure¶
data_images
├── __chimcla_data__.txt → the presence of this file indicates that its parent directory
│ is the root of all relevant chimcla-data
├── jpg1000 → rescaled work images (jpg), result of manual preprocessing
├── raw → original images in png format (before saving every image)
├── raw_jpg → original images in jpg format (necessary to reduce transfer load)
│ ├──
│ └──
│
├── pp_result → host to the output dirs of automated preprocessing
│ ├── ...
│ └── <lotdir>
│ ├── part000
│ │ └── shading_corrected
│ │ ↳ main output of preprocessing
│ └── ...
│
└── png_paths.txt → file created by preparation command
Step 0: Lot Preparation¶
Motivation and Background¶
There are many (>>100K) raw images. If they are located in a single directory typical file browsers are very slow or even crash. To simplify their handling they are split into “lots”. Each lot corresponds to one cycle of production without longer interruption (e.g. 3 days). Each lot is subdivided into parts of (ca.) 1000 raw images.
How To Do It¶
preparation: create a list of paths:
manually move all images from
$HOME/mnt/XAI-DIA-gl/Sascha/Images_from_Peineto$HOME/mnt/XAI-DIA-gl/Carsten/data_images/raw_jpg(speed: 10K/min)workdir:
$HOME/mnt/XAI-DIA-gl/Carsten/data_images/raw_jpgcommand:
find . -type f -name '*.jpg' > jpg_paths.txt(takes approx. 40s for 200K image)
usage:
chimcla split-into-lots ~/mnt/XAI-DIA-gl/Carsten/data_images/raw_jpg/jpg_paths.txt(takes 20m for 200K images)manual post processing
move subdirectories of
data_images/raw_jpg/lots/intodata_images/lots(make sure that nothing is overwritten)