Anomaly detection in text documents

Are there any algorithms known to work well in natural language processing to identify anomalies?

I am thinking about legal documents - NDA’S in particular and would like to derive if uncommon or less secure clauses were added to the document.

Based on the course work, it doesn’t seem such algorithm was taught or inferred. A possible way I could see this done is to try account for every clause and determine if there are any odd clauses not accounted for.

Were you able to find anything?

Hi, unfortunately this project did not take place eventually - so basically I stopped working on it.

Still I believe it is an interesting topic and would be interested in exchanging ideas.