Does anybody know if there is specific tokenizer for Twitter?


This is a rather technical question. I will try to answer it the best I can. As I can see, tokenization has some disambiguation. My guess is that you want to know if there is an tokenizator that can segment sentences into words. A search on the net shows that there seems to be many different tokenizators for use with Twitter. Here is a link to one of these (java)

20 October 2014 - 11:08