-
Notifications
You must be signed in to change notification settings - Fork 214
Some questions about training my custom datasets #185
Comments
Hi @bjutlss , I have no answer to your question, but I'm trying to train the model on the LINEMOD dataset and I have no good results. How did you do to train your model and with which version of CUDA, python and PyTorch ? I'm using CUDA 11.6, python 3.9.16, pytorch 1.13.1 and I have this result : and these parameters in yolo-pose.cfg : Did you know which parameters to adjust to have a good result on the training ? Many thanks Aristide |
Well, My CUDA 11.1, python 3.8, pytorch 1.8.0 .
I just read the README to train the model without changing any parameters except batchsize.
But, The problem could be the wrong 3D model or the wrong diameter parameter.
…------------------ 原始邮件 ------------------
发件人: "microsoft/singleshotpose" ***@***.***>;
发送时间: 2023年3月17日(星期五) 晚上6:26
***@***.***>;
***@***.******@***.***>;
主题: Re: [microsoft/singleshotpose] Some questions about training my custom datasets (Issue #185)
Hi @bjutlss ,
I have no answer to your question, but I'm trying to train the model on the LINEMOD dataset and I have no good results.
How did you do to train your model and with which version of CUDA, python and PyTorch ?
I'm using CUDA 11.6, python 3.9.16, pytorch 1.13.1
and with this command :
and I have this result :
and these parameters in yolo-pose.cfg :
Did you know which parameters to adjust to have a good result on the training ?
Many thanks
Aristide
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Thanks you for your quick answer ! |
If you can train normally, then it shouldn't be a problem with the CUDA, python version, the 3D models and rgb、labels don't correspond and that can happen.
…------------------ 原始邮件 ------------------
发件人: "microsoft/singleshotpose" ***@***.***>;
发送时间: 2023年3月17日(星期五) 晚上6:46
***@***.***>;
***@***.******@***.***>;
主题: Re: [microsoft/singleshotpose] Some questions about training my custom datasets (Issue #185)
Thanks you for your quick answer !
I will try with your version of pytorch and python.
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
So why I have no results on the training with the exact parameters, I change nothing and the validation works fine with there weights but not the training, I don't understand !! The prediction is in red and the ground truth in green |
You may have to find out for yourself, but that's all I can help you with. I just said what I thought!
…------------------ 原始邮件 ------------------
发件人: "microsoft/singleshotpose" ***@***.***>;
发送时间: 2023年3月17日(星期五) 晚上7:12
***@***.***>;
***@***.******@***.***>;
主题: Re: [microsoft/singleshotpose] Some questions about training my custom datasets (Issue #185)
So why I have no results on the training with the exact parameters, I change nothing and the validation works fine with there weights but not the training, I don't understand !!
Here what I have on all images, it seems to detect but not with the right dimensions :
![1AAABOX3D10](https://user-images.githubusercontent.com/46279890/225888992-85636c36-41a7-44e7-ba6c-03b05b330e6
e.png)
The prediction is in red and the ground truth in green
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Many thanks for your response. |
------------------ 原始邮件 ------------------
发件人: "microsoft/singleshotpose" ***@***.***>;
发送时间: 2023年3月17日(星期五) 晚上7:23
***@***.***>;
***@***.******@***.***>;
主题: Re: [microsoft/singleshotpose] Some questions about training my custom datasets (Issue #185)
Many thanks for your response.
Just what do you have in your yolo-pose.cfg about weight and height please ?
thanks
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Sorry I can't see your response correctly |
[net]
# io
batch=4
height=416
width=416
channels=3
num_keypoints=9
# training
momentum=0.9
decay=0.0005
angle=0
burn_in=1000
max_batches = 80200
policy=steps
max_epochs=500
learning_rate=0.001
steps=-1,80,160
scales=0.1,0.1,0.1
# test - eliminate low confidence predictions during testing
conf_thresh= 0.1
test_width=672
test_height=672
# data augmentation
saturation = 1.5
exposure = 1.5
hue=.1
[convolutional]
batch_normalize=1
filters=32
size=3
stride=1
pad=1
activation=leaky
[maxpool]
size=2
stride=2
[convolutional]
batch_normalize=1
filters=64
size=3
stride=1
pad=1
activation=leaky
[maxpool]
size=2
stride=2
[convolutional]
batch_normalize=1
filters=128
size=3
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=64
size=1
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=128
size=3
stride=1
pad=1
activation=leaky
[maxpool]
size=2
stride=2
[convolutional]
batch_normalize=1
filters=256
size=3
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=128
size=1
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=256
size=3
stride=1
pad=1
activation=leaky
[maxpool]
size=2
stride=2
[convolutional]
batch_normalize=1
filters=512
size=3
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=256
size=1
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=512
size=3
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=256
size=1
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=512
size=3
stride=1
pad=1
activation=leaky
[maxpool]
size=2
stride=2
[convolutional]
batch_normalize=1
filters=1024
size=3
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=512
size=1
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=1024
size=3
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=512
size=1
stride=1
pad=1
activation=leaky
[convolutional]
batch_normalize=1
filters=1024
size=3
stride=1
pad=1
activation=leaky
#######
[convolutional]
batch_normalize=1
size=3
stride=1
pad=1
filters=1024
activation=leaky
[convolutional]
batch_normalize=1
size=3
stride=1
pad=1
filters=1024
activation=leaky
[route]
layers=-9
[convolutional]
batch_normalize=1
size=1
stride=1
pad=1
filters=64
activation=leaky
[reorg]
stride=2
[route]
layers=-1,-4
[convolutional]
batch_normalize=1
size=3
stride=1
pad=1
filters=1024
activation=leaky
[convolutional]
size=1
stride=1
pad=1
# filters=125
filters=20
activation=linear
[region]
anchors =
bias_match=1
classes=1
coords=18
num=1
softmax=1
jitter=.3
rescore=1
object_scale=5
noobject_scale=0.1
class_scale=1
coord_scale=1
absolute=1
thresh = .6
random=1
…------------------ 原始邮件 ------------------
发件人: "microsoft/singleshotpose" ***@***.***>;
发送时间: 2023年3月17日(星期五) 晚上9:14
***@***.***>;
***@***.******@***.***>;
主题: Re: [microsoft/singleshotpose] Some questions about training my custom datasets (Issue #185)
Sorry I can't see your response correctly
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Thank you @bjutlss I have the same ! |
I have the same problem, did you solve it? |
Not yet
…------------------ 原始邮件 ------------------
发件人: ***@***.***>;
发送时间: 2023年3月25日(星期六) 下午2:10
收件人: ***@***.***>;
抄送: ***@***.***>; ***@***.***>;
主题: Re: [microsoft/singleshotpose] Some questions about training my custom datasets (Issue #185)
***@***.***我有同样的 !
I have the same problem, did you solve it?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
First of all, thank you for your work!
First,I consulted ObjectDatasetsTools and made my custom datasets with D435i.
1.I read this paper and learned that the regularization parameter needs to be set to 0 in pre-training. So am I right to train my data set directly with the following code?
python train.py --datacfg cfg/myobj.data --modelcfg cfg/yolo-pose.cfg --initweightfile cfg/darknet19_448.conv.23 --pretrain_num_epochs 15
or
python train.py --datacfg cfg/myobj.data --modelcfg cfg/yolo-pose.cfg --initweightfile backup/duck/init.weights
What's the difference between yolo-pose.cfg and yolo-pose-pre. cfg?
yolo -pose-pre. cfg is used for pre-training my target object?
2.After training, I had ran valid.py to evaluate the model and get the following results:
The Acc 3D transformation is significantly lower than the other two terms. Is this a normal result?
3.If the above steps are correct, I take a frame of the video stream and use it as RGB input to get the results of R_pr and t_pr, which coordinate system are they relative to the camera? (I used Realsense D435i).I'm going to view the resulting RT matrix as relative to the optical coordinate system. It was sent to the robot arm and got an incorrect grab.
@btekin @snsinha Can you answer these questions for me? Look forward to your quick answer!
The text was updated successfully, but these errors were encountered: