Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding the recurring problem of Dual modal Attention Enhanced Text Video Retrieval with Triplet Partial Margin Comparative Learning #7

Closed
liangchild opened this issue Mar 15, 2024 · 1 comment

Comments

@liangchild
Copy link

Hello, I would like to inquire about an issue related to replicating the study "Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning". The paper mentions that both the text and video encoders utilize CLIP, yet in the code you provided, the base.yml configuration file specifies the text encoder as BERT and does not disclose what is used for the video encoder. Could you provide the configuration file for CLIP that you used?

@echojiang0830
Copy link
Collaborator

Hello, I would like to inquire about an issue related to replicating the study "Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning". The paper mentions that both the text and video encoders utilize CLIP, yet in the code you provided, the base.yml configuration file specifies the text encoder as BERT and does not disclose what is used for the video encoder. Could you provide the configuration file for CLIP that you used?

Thank you very much for following our related work, and we apologize for the failure of the reproduction work due to some company regulations.
I must clarify the following points:
Firstly, for a fair comparison with other methods (such as minimizing the impact of code framework/runtime environment on the results), the results in our paper were implemented based on the ts2net code framework(https://github.com/yuqi657/ts2_net);
Secondly, due to some company regulations, the code in our paper needs to be open-source based on the current Antmmf framework;
Based on the above reasons, the existing scripts are currently unable to fully reproduce the results in our paper. To reproduce the corresponding results, it is necessary to migrate them to the code framework of ts2net.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants