当前位置:网站首页>NLP-Reading Comprehension Task Learning Summary Overview
NLP-Reading Comprehension Task Learning Summary Overview
2022-08-09 16:48:00 【wwlsm_zql】
Reading comprehension task general form
- Cloze: Given an article, mask out part of the article, and then let the model fill in according to the context
- The form of reading comprehension is very similar to pre-training, so it is very suitable for pre-training models
- The cloze model can be thought of as maximizing the conditional probability P(a|C-a), where a is the removed content and C is the context
- Usually, the probability distribution output (attention value) is performed on the candidate dictionary, and the maximum value is obtained as the result after normalization
- Limited application scenarios
- Multiple choice: Given an article, a question, and multiple alternative answers, the model selects the correct answer from multiple alternative answers
- Multiple choice model, which can be seen as maximizing probability P(a|C,Q,A), where a is the correct answer, C is the context, Q is the question, and A is the list of candidate answers
- Similar to the two-tower model, one side is the input article and question, the other side is the candidate answer, and the final output probability value is the highest
- There are often no candidate answers in actual scenarios, and the limitations are very large
- Segment extraction: Given an article and a question, let the model extract a continuous sequence from the article as the answer to the question
- Segment extraction task, treat the article as a collection of different sequences, select consecutive subsequences as the answer, and the output is generally the start and end indexes
- The model can be seen as maximizing the probability P(a|C,Q), where C is the context (article), Q is the question, and a is the set of extracted sequences
- Freedom generation: Given an article and a question, let the model generate a sequence to answer the question, not limited to sentences in the article
- The smoothness and correctness of freely generated content is difficult
- Cloze: Given an article, mask out part of the article, and then let the model fill in according to the context
Reading comprehension assessment indicators
- Accuracy: The proportion of correct results in all prediction results
- There are standard answers for cloze and multiple choice, so accuracy is generally used to judge
- EM(Exact Match): The predicted result matches the correct result, and the exact match is counted as 1, the percentage of the calculation
- Commonly used for extractive reading comprehension
- F1: Harmonic mean of precision and recall
- Commonly used for extractive reading comprehension
- BLEU-4: n-gram based evaluation metrics
- Commonly used in generative reading comprehension
- Rouge-L: Quasi-call weighted harmonic mean based on longest common subsequence
- It is often used to evaluate the quality of automatic summarization, where L represents the longest common subsequence
- Commonly used in generative reading comprehension
- Accuracy: The proportion of correct results in all prediction results
General model structure for reading comprehension
- Embedding Layer
- Used to embed the input data
- Common methods include pre-training, word vector technology
- Feature Layer
- This layer is mainly to obtain the context feature representation
- Common lstm, cnn, transformer
- Interaction Layer
- Realize the interaction between paragraph and paragraph, content and question, question and answer
- Common way (one-way/two-way) attention mechanism
- Output layer
- Generates the final result
- Close only needs to output the value with the highest probability distribution or highest score
- Multiple-choice tasks require scoring candidate answers to get the highest-scoring answer
- The extraction method needs to match the sequence combination most relevant to the problem, and output the start and end index
- Generative tasks need to be generated freely, similar to GPT-2
- Embedding Layer
边栏推荐
- spark shuffle
- 《平凡的世界》(书评)
- [Elementary C language] Detailed explanation of branch statements
- 分类任务系列学习——总述
- 什么是跨境电商测评?
- Sort method (Hill, Quick, Heap)
- WebGL探索——抉择:实践方向(twgl.js、Filament、Claygl、BabylonJS、ThreeJS、LayaboxJS、SceneJS、ThinkJS、ThingJS)
- 你知道亚马逊代运营的成本是多少吗?
- 鸡生蛋,蛋生鸡问题。JS顶级对象Function,Object关系
- 抱抱脸(hugging face)教程-中文翻译-模型概要
猜你喜欢
随机推荐
Sort method (Hill, Quick, Heap)
【研究生工作周报】(第十周)
微信小程序转盘demo
At the beginning of the C language order 】 【 o least common multiple of three methods
【论文阅读】LIME:Low-light Image Enhancement via Illumination Map Estimation(笔记最全篇)
Welcome to use CSDN - markdown editor
研究生工作周报(第四周)
Server运维:设置.htaccess按IP和UA禁止访问
抱抱脸(hugging face)教程-中文翻译-模型概要
pyspark dataframe分位数计算
C language operator precedence
链游是什么意思 链游和游戏的区别是什么
YOLOV2详解
stream去重相同属性对象
如何在实际操作中进行亚马逊测评
pyspark jieba 集群模型 对文本进行切词
WebGL:BabylonJS入门——初探:数据展示
Qt control - QTextEdit usage record
ASP.Net Core实战——身份认证(JWT鉴权)
function calling convention









