Need more instructions to reproduce the extractive oracle of booksum-chapter #26

lzhou1998 · 2022-03-05T05:48:33Z

Hi, thank you for your great works!

I plan to reproduce some of your baseline result due to #22 , but I met some problems when reproducing the extractive oracle of booksum-chapter and get a slightly different result from your paper, where I got ROUGE-1/2/L (F1) 42.38/9.82/20.62 while 42.68/9.66/21.33 are posted in your paper.

Here are my steps:

Split text in BOOKSUM-paragraph (lines in chapter_summary_aligned_{}_split.jsonl.gathered.stable) into sentences by spaCy, and compute oracles for each instance as Section 4.2 in your paper.
Split text in BOOKSUM-chapter (lines in chapter_summary_aligned_{}_split.jsonl.gathered) into paragraphs by function "merge_text_paragraphs()" in align_data_bi_encoder_paraphrase.py, then split paragraphs into sentences individually as Step 1.
Mapping ALL of the oracle sentences gained from Step 1 to chapter sentences of BOOKSUM-chapter that gained from Step 2.
Now I have BOOKSUM-chapter that texts are split into sentences and each sentence is marked whether it is an oracle, and I can compute ROUGE for each chapter instance.

Any wrong places in my steps? Can you give more instructions about how you perform this?

Another question is, it seems that those extractive models are not directly provided in Huggingface and need additional efforts to reproduce. Do you train and evaluate the models such as BertExt, MatchSum by using codes of their original repos? Can you also give some instructions about this?

Thank you very much! @jigsaw2212 @muggin

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need more instructions to reproduce the extractive oracle of booksum-chapter #26

Need more instructions to reproduce the extractive oracle of booksum-chapter #26

lzhou1998 commented Mar 5, 2022

Need more instructions to reproduce the extractive oracle of booksum-chapter #26

Need more instructions to reproduce the extractive oracle of booksum-chapter #26

Comments

lzhou1998 commented Mar 5, 2022