Home:ALL Converter>Fasttext document embedding code in python

Fasttext document embedding code in python

Ask Time:2017-07-06T02:01:21         Author:ayush singhal

Json Formatter

Currently Fasttext is producing sentence vectors by taking average of the normalized word vectors of words in the sentence. Is it the best way to come up with the sentence vector.

Or whether using a tfidf weighting of words and then subtracting the first PCA component from it as discussed in this paper: https://openreview.net/pdf?id=SyK00v5xx will work better. And is there any such implementation already there within fasttext. If so where and how to use it through python.

Also, do I need to remove stop words from the sentence when I compute its fasttext vector?

Also in the python binding of fasttext how are to compute the sentence vector. There seem to be no syntax for that. Any comments.

Author:ayush singhal,eproduced under the CC 4.0 BY-SA copyright license with a link to the original source and this disclaimer.
Link to original article:https://stackoverflow.com/questions/44933073/fasttext-document-embedding-code-in-python
yy