Skip to content

Conversation

@mariamabarham
Copy link
Contributor

add DoQA dataset (ACL 2020) http://ixa.eus/node/12931

Copy link
Member

@lhoestq lhoestq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cool ! :D

I left a few comments. Feel free to also do make style

BUILDER_CONFIGS = [
DoqaConfig(
name="cooking",
description=""
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can simply remove the description in that case ?

name=nlp.Split.TEST,
gen_kwargs={"filepath": os.path.join(path, "doqa-v2.1", "doqa_dataset", "doqa-travel-test-v2.1.json")},
)
]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe raise an error if the config name is wrong ?

Comment on lines +162 to +163
orig_answer_start = [qa["orig_answer"]["answer_start"]]
orig_answer_text = [qa["orig_answer"]["text"]]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Those two fields are supposed to be Sequence according to the features description, but it looks like they always contain exactly one element. Is this intended ?

@jplu jplu merged commit 10b6e9d into master Sep 3, 2020
@jplu jplu deleted the doqa_dataset branch September 3, 2020 11:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants