鞍点,saddle point
变换,transform
编码器,encoder
标签,label
步幅,stride
参数,parameter
长短期记忆网络,long short-term memory (LSTM)
超参数,hyperparameter
层序softmax,hierarchical softmax
查准率,precision
成本,cost
词表,vocabulary
词嵌入,word embedding
词向量,word vector
词元,token
词元分析器,tokenizer
词元化,tokenize
汇聚层,pooling layer
稠密,dense
大小,size
导入,import
轮,epoch
暂退法,dropout
动量法,momentum (method)
独立同分布,independent and identically distributed (i.i.d.)
端到端,end-to-end
多层感知机,multilayer perceptron
多头注意力,multi-head attention
二元分类,binary classification
二元,bigram
子采样,subsample
发散,diverge
泛化,generalization
泛化误差,generalization error
方差,variance
分类,classification
分类器,classifier
负采样,negative sampling
感受野,receptive field
格拉姆矩阵,Gram matrix
共现,co-occurrence
广播,broadcast
规范化,normalization
过拟合,overfitting
核回归,kernel regression
恒等映射,identity mapping
假设,hypothesis
基准,baseline
激活函数,activation function
解码器,decoder
近似法,approximate method
经验风险最小化,empirical risk minimization
局部最小值,local minimum
卷积核,convolutional kernel
卷积神经网络,convolutional neural network
决策边界,decision boundary
均值,mean
均方误差,mean squared error
均匀采样,uniform sampling
块,block
困惑度,perplexity
拉普拉斯平滑,Laplace smoothing
连结,concatenate
类,class
交叉熵,cross-entropy
连续词袋,continous bag-of-words (CBOW)
零张量,zero tensor
流水线,pipeline
滤波器,filter
门控循环单元,gated recurrent units (GRU)
目标检测,object detection
偏置,bias
偏导数,partial derivative
偏移量,offset
批量,batch
齐普夫定律,Zipf's law
欠拟合,underfitting
情感分析,sentiment analysis
全连接层,fully-connected layer
权重,weight
三元,trigram
上采样,upsample
上下文变量,context variable
上下文窗口,context window
上下文词,context word
上下文向量,context vector
实例/示例,instance
收敛,converge
属性,property
数值方法,numerical method
数据集,dataset
数据示例,data instance
数据样例,data example
顺序分区,sequential partitioning
softmax回归,softmax regression
随机采样,random sampling
损失函数,loss function
双向循环神经网络,bidirectional recurrent neural network
特征,feature
特征图,feature map
特征值,eigenvalue
梯度,gradient
梯度裁剪,gradient clipping
梯度消失,vanishing gradients
填充,padding
跳元模型,skip-gram model
调参,tune hyperparameter
停用词,stop words
通道,channel
凸优化,convex optimization
图像,image
未知词元,unknown token
无偏估计,unbiased estimate
误差,error
小批量,minibatch
小批量梯度,minibatch gradient
线性模型,linear model
线性回归,linear regression
协同过滤,collaborative filtering
学习率,learning rate
训练误差,training error
循环神经网络,recurrent neural network (RNN)
样例,example
一维梯度下降,gradient descent in one-dimensional space
一元,unigram
隐藏变量,hidden variable
隐藏层,hidden layer
优化器,optimizer
语料库,corpus
运算符,operator
自注意力,self-attention
真实值,ground truth
指标,metric
支持向量机,support vector machine
注意力机制,attention mechanism
注意力模型,attention model
注意力提示,attention cue
准确率/精度,accuracy