Gradient descent gives one way of minimizing J. A second way of doing so, this time performing the minimization explicitly and without resorting to an ...
So far, we have been talking about discriminative models which map input features x to labels y and approximate P(y/x) - Baye's law.Generative models do ...
I have been learning through Andrew Ng's Deep Learning specialization on Coursera. I have completed the 1st of the 5 courses in the specialization ...
I decided to go through some of the break through papers in the field of NLP (Natural Language Processing) and summarize my learnings. The papers date from ...
I recently attended a talk by Kevin Clarke (CS224n) where he talked about the future trends in NLP. I am writing this post to summarize and discuss the ...
Hello!I have been enrolled at Stanford and have been taking their courses online. Here are my few cents on the ones I have taken so far. CS224n - Natural ...
I have been asked a lot of questions lately about Stanford's online course offerings and why somebody would choose them over myriad of options online. ...
Hi everyone,There are a ton of language models out there today! Many of which have their unique way of learning "self-supervised" language representations ...