Skip to content

Summary of potential issues (20220608) #253

@pfliu-nlp

Description

@pfliu-nlp

1. cmrc2019

  • dataset = load_dataset("cmrc2019")
  • task_type: cloze-multiple-choice
    image
  • example: load_dataset("gaokao2018_np1", "cloze-multiple-choice")

2. dureader_yesno

  • should answer be answers?
  • we should introduce context as a column
  • this is not a qa_extractive? it should be qa_multiple_choice or qa_bool?
@register_task(TaskType.qa_bool_dureader)
@dataclass
class QuestionAnsweringBoolDureader(QuestionAnswering):
    task: TaskType = TaskType.qa_bool_dureader
    question_column: str = "question"
    context_column: str = "documents"
    answers_column: str = "answers"
    
    answers: {"text": "xxx", "yesno_answer":"Yes"}

3. dureader_search

  • the task is qa_extractive while the context_column = "documents" is not a string

4. ckbqa

  • this dataset could be broken down to two tasks
    • qa_open_domain: question_column, answers_column
    • text_to_sql: question_column, sql_column

5. coqa

  • Similar to the above one

6. dureader_robust

answers = {"text": answer_text, "answer_start": answer_start}
(1) answers = [{"text": answer_text, "answer_start": answer_start}]
(2) answers = {"text": [answer_text], "answer_start": [answer_start]}

7. ccpm

8. cail2019

  • Similar to the above one

ccks2019_fin

  • the event type should also be regarded as one input?

ccks2020_fin_ee

ccks2021_fin_ea

  • it seems that the schema of arguments is different from the above one, so we probably need to modify the task name a little bit
  • v.s 2020: define a new task schema for ccks2021_fin_ea?

ccks2021_fin_re

  • it seems that the schema of relation is pretty complicated, should we modify the task name of event_relation_extraction

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions