태그
paper review
multi-modal
Paper Review Code
CS224N
transformer
uniter
lxmert
muti-modal
visualbert
Lecture 5
iPET
Lecture 3
Log Likelihood
Paper with code
LLaVA
Multimodal Learning
Paper Reivew
mPLUG
Negative log likelihood
GPT-1
GPT-3
regularization
seq2seq
GPT-2
object detection
r-cnn
chain rule
back propagation
stochastic gradient descent
loss function
Gradient descent
Likelihood
bert
Roberta
ELMO
optimization
NLP
L2
MLE
l1
ELECTRA