Anomaly detection in text

Here you find the details for the internship named "Anomaly detection in text" in the company ML6.

Name: Anomaly detection in text
Company: ML6

In recent times anomaly detection has become a major topic within the field of AI. It has particularly gained traction within the domain of computer vision with use cases in defect detection, predictive maintenance, etc. and within the domain of structured data with use cases in fraud detection, spam filtering, etc. However, the development of similar techniques with the domain of NLP remains an understudied subject despite its potential.

The goal of this project is to leverage different NLP techniques to arrive at an algorithm that can accurately highlight anomalous words and/or sentences in a document that you wouldn’t expect to appear in that document. This system would likely exploit the repetitive nature of certain types of documents (e.g, rental contracts are 90% the same because they need a certain legal structure). If successful, such an algorithm could have very impactful use cases in fields such as the legal domain, insurance companies, etc. The concrete approach would be to develop such a system on legal documents but we are open to suggestions if there is another field that interests you more where it could also have a big impact.

Target profiles:
    In industries:
      Required special knowledge:

      -Strong analytical abilities, knowledge of different statistical methods, not scared by mathematics and a familiarity with research studies.
      -Strong interest in Computer Vision / NLP / Other subdomain [preferred]
      -Familiarity with statistical analysis languages and tools like Python, SQL.
      -Excellent verbal and written communication in English.
      -You are currently pursuing a degree in computer science or related field.

      Duration: min 8 weeks
      Paid: Nee
      Net wage: -
      Foreign: Nee
      Contact: Julie Plusquin (Talent Partner)