I was describing word embeddings to a friend of mine, and he enthusiastically asked me to run it on his website which contains thousands of links (to html and pdf documents, mostly).
Does anyone have a handy tool for going from a single url to a concatenation of all the linked texts?