INDEX 771
MLM, 550
Modality, 582
Model Compression, 452
Model Parameters, 305
Model Score, 62
Model Training, 211
Momentum, 312
Multi-branch, 504
Multi-head Attention, 413
Multi-hop Attention, 388
Multi-lingual Single Model-based Method, 561
Multi-model Machine Translation, 582
Multi-stage Inference, 469
Multi-step Attention, 388
Multimodality Problem, 483
Multitask Learning, 531
Named Entity, 79
Named Entity Recognition, 79
Natural Language Processing, 35
Nesterov Accelerated Gradient, 393
Nesterov 加速梯度下降法, 393
Neural Architecture Search, 533
Neural Language Model, 325
Neural Machine Translation, 337
Neural Networks, 275
Noise Channel Model, 153
Non-autoregressive Model, 374
Non-Autoregressive Translation, 481
Non-terminal, 90
Norm, 285
Numerical Differentiation, 309
Objective Function, 305
Offline Speech Translation, 583
One-hot 编码, 330
Open Vocabulary, 428
Optimal Stopping Criteria, 67
Out-of-vocabulary Word, 52
Over Translation, 470
Overfitting, 318
Padding Mask, 414
Parameter, 51
Parameter Estimation, 45
Parameter Server, 316
Paraphrase Matcher, 114
Paraphrasing, 546
Parent Model, 560
Parsing, 72
Perplexity, 57
Phrasal Segmentation, 195
Phrase Extraction, 202
Phrase Pairs, 196
Phrase Structure Parsing, 90
Phrase Table, 205
Physical Deficiency, 188
Piecewise Constant Decay, 371
Pivot Language, 558
Pointwise Convolution, 395
Policy Gradient, 448
Porter Stem Model, 112
Position Embedding, 388
Position-independent word Error Rate, 108
Post-editing, 613
Post-norm, 417
Post-processing, 73
Pre-emphasis, 584
Pre-norm, 417
Pre-processing, 73
Pre-terminal, 90
Pre-training, 548
Precision, 110
Prediction, 58
Probabilistic Context-free Grammar, 95
Probabilistic Graphical Model, 81