How to research stuff using Twitter


Key to this method - follow lots of opinionated people.#

I follow many people - more than 600. There are many ML experts. So if I’m researching what methods to use, what algorithms, or just want to learn about a subject I’ll use Twitter first for that.

Let’s say I want to learn how to do “topic modeling” on a dataset made of tweets. I just use search, remembering to check “People you follow

search

I especially liked this tweet:

Without any prior knowledge, just from that one tweet by @pmbaumgartner , you can learn that there is a traditional topic modeling method called “LDA”, and it doesn’t provide good results for tweets. “BERT embeddings/Universal Sentence Encoder → UMAP → HDBSCAN” seem to work better.

But let’s look at the whole discussion.

Treeverse#

I go to the “main” tweet and use a browser plugin called Treeverse

This is the tweet that started the whole discussion.

Treeverse gives me a good overview - Twitter sometimes hides some tweets, so I prefer to use Treeverse.

Treeverse

One other tweet contained tricks on how to improve the performance of modeling:

There was also a response by this guy:

Remember “BERT embeddings/Universal Sentence Encoder → UMAP → HDBSCAN” from the previous tweet?

Leland

Leland happens to be the researcher behind UMAP and HDBSCAN.

Instant “follow” for me.

Ok, so what about LDA? What do other people think about it? I repeated the search, remembering to check “People you follow”.

This time I needed to dig a little deeper, focused on this tweet:

The tree got way bigger (Rachael has 22k followers)

But if you zoom in:

We see familiar faces (@pmbaumgartner , leland_mcinnes )

Leland even provides code examples for his approach.

https://gist.github.com/lmcinnes/fbb63592b3225678390f08e50eda2b61

Twitter is good for finding different approaches/methods, way better than any other forum-like solution.

But the key is following correct people.

And do bias to the side of following more opinionated folks.