General course chat

Hi all

Have started working through fast.ai and I love it.@jeremy, is the best teacher I’ve found thus far as far as AI/machine learning goes. I’m up to course 5 but now wondering…

How, if at all, can one do text similarity with fast.ai to answer questions like: did this person write this text? (Answer = "XX% confident they did/did not)

In other words, be a text similarity / plagiarism / author detection application. Reason I ask is because I’m curious to see whether fast.ai can provide a fresh look at answering “Who is Satoshi Nakamoto?” question and I plan on building a tutorial jupyter notebook so that others can debate it, improve it, etc.

Thus far I’ve found:

  1. News headlines claiming someone isn’t satoshi but failing to release the papers/work (can’t post 3 links as I’m new but if you google “ibtimes satoshi text analysis” it’s the 1st result)

  2. Chon’s work which used support vector machines, random forests etc but didn’t include all parties, such as the very controversial Australian computer scientist Craig Wright – see https://towardsdatascience.com/stylometric-analysis-satoshi-nakamoto-294926cdf995

  3. By searching this forum, a trump post – Medium: Great NLP application article (who authored Trump tweets?) – which also used random forests, gradient boosting, etc

Could fast.ai have a better tool for the job?

If you have any advice, tips or links to any methods or tutorials, please share because I’d love to work on this on my nights and weekends. I’ll also share my results with the community.

Thanks :slight_smile:

1 Like