keras-yolo3

Introduction

This project is a fork from qqwweee/keras-yolo3.

Warning: This fork has not been exhaustively tested and bugs are expected.

Quick Start

Download YOLOv3 weights from YOLO website.

wget https://pjreddie.com/media/files/yolov3.weights

Convert the Darknet YOLO model to a Keras model.
Prepare your project config file in yml

train_path: my_train_set.txt
test_path: my_test_set.txt
classes_path: model_data/my_classes.txt
anchors_path: model_data/my_anchors.txt
model_name: any_name_for_my_model
log_dir: logs/my-test/

Training

python3 train.py ---config_path myprojects/test1-config.yml -m 5000

Ps: For training the -m parameter is available to set a limit to GPU memory in MB.

Inference

python3 yolo.py ---config_path myprojects/test1-config.yml --weights logs/seg-000/ep004-loss-106.913-val_loss-114.463.h5

Ps: For inference the memory is limited in 30%.

MultiGPU usage is an optional. Change the number of gpu and add gpu device id

Training

Generate your own annotation file and class names file.
One row for one image;
Row format: image_file_path box1 box2 ... boxN;
Box format: x_min,y_min,x_max,y_max,class_id (no space).
For VOC dataset, try python voc_annotation.py
Here is an example:
```
path/to/img1.jpg 50,100,150,200,0 30,50,200,120,3
path/to/img2.jpg 120,300,250,600,2
...
```

1.1 To generate the train.txt file from a pre-configured folder used in darknet training, you can use the darknet_annotation.py script and change the WIDTH and HEIGHT parameters.

Make sure you have run python convert.py -w yolov3.cfg yolov3.weights model_data/yolo_weights.h5
The file model_data/yolo_weights.h5 is used to load pretrained weights.
Modify train.py and start training.
python train.py
Use your trained weights or checkpoint weights in yolo.py.
Remember to modify class path or anchor path.

If you want to use original pretrained weights for YOLOv3:
1. wget https://pjreddie.com/media/files/darknet53.conv.74
2. rename it as darknet53.weights
3. python convert.py -w darknet53.cfg darknet53.weights model_data/darknet53_weights.h5
4. use model_data/darknet53_weights.h5 in train.py

Prediction

The yolo.py script has been modified and now outputs the inference_output_<version>.txt file every time it is ran. The <version> is the current datetime and exists to create a simple inference history.

Prediction output format.
One row for one image;
Row format: image_file_path prediction1 predicion2 ... predictionN;
Prediction format: x_min,y_min,x_max,y_max,class_id,confidence_score (no space).
Here is an example:
```
path/to/img1.jpg 50,100,150,200,0,0.9876 30,50,200,120,3,0.3211
path/to/img2.jpg
path/to/img3.jpg 120,300,250,600,2,0.8319
...
```

Some issues to know

The test environment is
- Python 3.5.2
- Keras 2.1.5
- tensorflow 1.6.0
Default anchors are used. If you use your own anchors, probably some changes are needed.
The inference result is not totally the same as Darknet but the difference is small.
The speed is slower than Darknet. Replacing PIL with opencv may help a little.
Always load pretrained weights and freeze layers in the first stage of training. Or try Darknet training. It's OK if there is a mismatch warning.
The training strategy is for reference only. Adjust it according to your dataset and your goal. And add further strategy if needed.
For speeding up the training process with frozen layers train_bottleneck.py can be used. It will compute the bottleneck features of the frozen model first and then only trains the last layers. This makes training on CPU possible in a reasonable time. See this for more information on bottleneck features.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
font		font
grv/pytrains		grv/pytrains
model_data		model_data
yolo3		yolo3
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
coco_annotation.py		coco_annotation.py
convert.py		convert.py
darknet53.cfg		darknet53.cfg
darknet_annotation.py		darknet_annotation.py
gen_weak_masks.py		gen_weak_masks.py
inference_time.txt		inference_time.txt
kmeans.py		kmeans.py
modify_annotation.py		modify_annotation.py
pti01_class_translation_default-and-occlusions-and-angled_003.yml		pti01_class_translation_default-and-occlusions-and-angled_003.yml
pti01_class_translation_default-and-occlusions_002.yml		pti01_class_translation_default-and-occlusions_002.yml
pti01_class_translation_discard-far-and-merge_001.yml		pti01_class_translation_discard-far-and-merge_001.yml
pti01_class_translation_discard-top-head-body-005.yml		pti01_class_translation_discard-top-head-body-005.yml
pti01_class_translation_discard-top-head-body-and-merge-006.yml		pti01_class_translation_discard-top-head-body-and-merge-006.yml
pti01_class_translation_one-class_004.yml		pti01_class_translation_one-class_004.yml
requirements.txt		requirements.txt
test.txt_20181024210151.txt		test.txt_20181024210151.txt
test.txt_20181024210234.txt		test.txt_20181024210234.txt
test_caltech_4024imgs_v20180829205033_keras.txt		test_caltech_4024imgs_v20180829205033_keras.txt
test_caltech_4024imgs_v20181024211330_keras.txt		test_caltech_4024imgs_v20181024211330_keras.txt
test_pti01_1585imgs_v20180706193526_keras.txt		test_pti01_1585imgs_v20180706193526_keras.txt
test_pti01_1585imgs_v20180706193526_keras.txt_new-ratio_round-type-normal_side-ajustment-one.txt		test_pti01_1585imgs_v20180706193526_keras.txt_new-ratio_round-type-normal_side-ajustment-one.txt
test_pti01_1585imgs_v20180706193526_keras.txt_new-ratio_round-type-up_side-ajustment-one.txt		test_pti01_1585imgs_v20180706193526_keras.txt_new-ratio_round-type-up_side-ajustment-one.txt
test_pti01_v20180925220014.txt_min-height-100.txt		test_pti01_v20180925220014.txt_min-height-100.txt
test_pti01_v20180925220014.txt_min-height-25.txt		test_pti01_v20180925220014.txt_min-height-25.txt
test_pti01_v20180925220014.txt_min-height-50.txt		test_pti01_v20180925220014.txt_min-height-50.txt
test_pti01_v20180925220014.txt_min-height-75.txt		test_pti01_v20180925220014.txt_min-height-75.txt
test_pti01_v20180925220014_keras.txt		test_pti01_v20180925220014_keras.txt
test_pti01_v20180925220014_keras.txt_new-ratio_round-type-normal_side-ajustment-one.txt		test_pti01_v20180925220014_keras.txt_new-ratio_round-type-normal_side-ajustment-one.txt
test_pti01_v20180925220014_keras_auto-generated.txt		test_pti01_v20180925220014_keras_auto-generated.txt
test_pti01_v20181004190903_only-C_ED4A-03_keras.txt		test_pti01_v20181004190903_only-C_ED4A-03_keras.txt
test_pti01_v20181004190903_only-C_ED4A-03_keras_auto-generated.txt		test_pti01_v20181004190903_only-C_ED4A-03_keras_auto-generated.txt
test_pti01_v20181004192622_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_C_BLC13-13_C_BLC03-05_keras.txt		test_pti01_v20181004192622_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_C_BLC13-13_C_BLC03-05_keras.txt
test_pti01_v20181004192622_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_C_BLC13-13_C_BLC03-05_keras_auto-generated.txt		test_pti01_v20181004192622_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_C_BLC13-13_C_BLC03-05_keras_auto-generated.txt
test_pti01_v20181005161459_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras.txt		test_pti01_v20181005161459_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras.txt
test_pti01_v20181005161459_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras_auto-generated.txt		test_pti01_v20181005161459_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras_auto-generated.txt
test_pti01_v20181018205401_keras.txt		test_pti01_v20181018205401_keras.txt
test_pti01_v20181018205401_keras_pti01_class_translation_default-and-occlusions-and-angled_003.txt		test_pti01_v20181018205401_keras_pti01_class_translation_default-and-occlusions-and-angled_003.txt
test_pti01_v20181018205401_keras_pti01_class_translation_default-and-occlusions_002.txt		test_pti01_v20181018205401_keras_pti01_class_translation_default-and-occlusions_002.txt
test_pti01_v20181018205401_keras_pti01_class_translation_discard-far-and-merge_001.txt		test_pti01_v20181018205401_keras_pti01_class_translation_discard-far-and-merge_001.txt
test_pti01_v20181018205401_keras_pti01_class_translation_discard-top-head-body-005.txt		test_pti01_v20181018205401_keras_pti01_class_translation_discard-top-head-body-005.txt
test_pti01_v20181018205401_keras_pti01_class_translation_discard-top-head-body-and-merge-006.txt		test_pti01_v20181018205401_keras_pti01_class_translation_discard-top-head-body-and-merge-006.txt
test_pti01_v20181018205401_keras_pti01_class_translation_one-class_004.txt		test_pti01_v20181018205401_keras_pti01_class_translation_one-class_004.txt
test_pti01_v20181018205450_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras.txt		test_pti01_v20181018205450_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras.txt
test_pti01_v20181018205450_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras_pti01_class_translation_discard-far-and-merge_001.txt		test_pti01_v20181018205450_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras_pti01_class_translation_discard-far-and-merge_001.txt
test_pti01_v20181018205450_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras_pti01_class_translation_one-class_004.txt		test_pti01_v20181018205450_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_keras_pti01_class_translation_one-class_004.txt
test_pti01_v20181018205505_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_C_BLC13-13_C_BLC10-11_keras.txt		test_pti01_v20181018205505_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_C_BLC13-13_C_BLC10-11_keras.txt
test_pti01_v20181018205505_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_C_BLC13-13_C_BLC10-11_keras_pti01_class_translation_one-class_004.txt		test_pti01_v20181018205505_only-C_ED4A-03_C_ED4A-02_C_BLC03-08_C_BLC13-13_C_BLC10-11_keras_pti01_class_translation_one-class_004.txt
test_pti01_v20181018205526_only-C_ED4A-03_keras.txt		test_pti01_v20181018205526_only-C_ED4A-03_keras.txt
test_pti01_v20181030213454_by_event_keras.txt		test_pti01_v20181030213454_by_event_keras.txt
test_pti01_v20181030213454_by_event_keras_pti01_class_translation_one-class_004.txt		test_pti01_v20181030213454_by_event_keras_pti01_class_translation_one-class_004.txt
test_shanshan_1x_keras.txt		test_shanshan_1x_keras.txt
train.py		train.py
train_bottleneck.py		train_bottleneck.py
train_caltech10x_42782imgs_v20180829205033_keras.txt		train_caltech10x_42782imgs_v20180829205033_keras.txt
train_caltech10x_42782imgs_v20181024211330_keras.txt		train_caltech10x_42782imgs_v20181024211330_keras.txt
train_caltech1x_4250imgs_v20181024211330_keras.txt		train_caltech1x_4250imgs_v20181024211330_keras.txt
train_pti01_6342imgs_v20180706193526_keras.txt		train_pti01_6342imgs_v20180706193526_keras.txt
train_pti01_6342imgs_v20180706193526_keras.txt_new-ratio_round-type-normal_side-ajustment-one.txt		train_pti01_6342imgs_v20180706193526_keras.txt_new-ratio_round-type-normal_side-ajustment-one.txt
train_pti01_6342imgs_v20180706193526_keras.txt_new-ratio_round-type-up_side-ajustment-one.txt		train_pti01_6342imgs_v20180706193526_keras.txt_new-ratio_round-type-up_side-ajustment-one.txt
train_pti01_v20180925220014.txt_min-height-100.txt		train_pti01_v20180925220014.txt_min-height-100.txt
train_pti01_v20180925220014.txt_min-height-25.txt		train_pti01_v20180925220014.txt_min-height-25.txt
train_pti01_v20180925220014.txt_min-height-50.txt		train_pti01_v20180925220014.txt_min-height-50.txt
train_pti01_v20180925220014.txt_min-height-75.txt		train_pti01_v20180925220014.txt_min-height-75.txt
train_pti01_v20180925220014_keras.txt		train_pti01_v20180925220014_keras.txt
train_pti01_v20180925220014_keras.txt_new-ratio_round-type-normal_side-ajustment-one.txt		train_pti01_v20180925220014_keras.txt_new-ratio_round-type-normal_side-ajustment-one.txt
train_pti01_v20180925220014_keras_auto-generated.txt		train_pti01_v20180925220014_keras_auto-generated.txt
train_pti01_v20180925220014_keras_discarded-3.txt		train_pti01_v20180925220014_keras_discarded-3.txt
train_pti01_v20181004190903_keras.txt		train_pti01_v20181004190903_keras.txt
train_pti01_v20181004190903_keras_auto-generated.txt		train_pti01_v20181004190903_keras_auto-generated.txt
train_pti01_v20181004190903_keras_discarded-3.txt		train_pti01_v20181004190903_keras_discarded-3.txt
train_pti01_v20181004192622_keras.txt		train_pti01_v20181004192622_keras.txt
train_pti01_v20181004192622_keras_auto-generated.txt		train_pti01_v20181004192622_keras_auto-generated.txt
train_pti01_v20181004192622_keras_discarded-3.txt		train_pti01_v20181004192622_keras_discarded-3.txt
train_pti01_v20181005161459_keras.txt		train_pti01_v20181005161459_keras.txt
train_pti01_v20181005161459_keras_auto-generated.txt		train_pti01_v20181005161459_keras_auto-generated.txt
train_pti01_v20181005161459_keras_discarded-3.txt		train_pti01_v20181005161459_keras_discarded-3.txt
train_pti01_v20181018205401_keras.txt		train_pti01_v20181018205401_keras.txt
train_pti01_v20181018205401_keras_pti01_class_translation_default-and-occlusions-and-angled_003.txt		train_pti01_v20181018205401_keras_pti01_class_translation_default-and-occlusions-and-angled_003.txt
train_pti01_v20181018205401_keras_pti01_class_translation_default-and-occlusions_002.txt		train_pti01_v20181018205401_keras_pti01_class_translation_default-and-occlusions_002.txt
train_pti01_v20181018205450_keras.txt		train_pti01_v20181018205450_keras.txt
train_pti01_v20181018205505_keras.txt		train_pti01_v20181018205505_keras.txt
train_pti01_v20181018205505_keras_pti01_class_translation_one-class_004.txt		train_pti01_v20181018205505_keras_pti01_class_translation_one-class_004.txt
train_pti01_v20181018205526_keras.txt		train_pti01_v20181018205526_keras.txt
train_pti01_v20181030213454_by_event_keras.txt		train_pti01_v20181030213454_by_event_keras.txt
train_pti01_v20181030213454_by_event_keras_pti01_class_translation_one-class_004.txt		train_pti01_v20181030213454_by_event_keras_pti01_class_translation_one-class_004.txt
train_seg.py		train_seg.py
train_shanshan_1x_keras.txt		train_shanshan_1x_keras.txt
voc_annotation.py		voc_annotation.py
yolo.py		yolo.py
yolo_eval_all.py		yolo_eval_all.py
yolo_video.py		yolo_video.py
yolov3-tiny.cfg		yolov3-tiny.cfg
yolov3.cfg		yolov3.cfg

License

gustavovaliati/keras-yolo3

Folders and files

Latest commit

History