Sr. Data Scientist - Natural Language Processing - Computational Linguistics - U.S. (any major city) - travel required

10Roof is currently seeking Data Scientists and NLP Research Scientists for exciting Big 4 consulting opportunities. This is an opportunity to join the innovation division of one of the most respected consulting firms in the United States as they seek to further their capabilities within natural language processing, computational linguistics, machine learning, and artificial intelligence. This is a high level consulting role requiring a Masters degree or PhD from an accredited college/university in Computer Science, Computational Linguistics, Statistics, Mathematics, Engineering, Bioinformatics, Physics, Operations Research, or related fields (strong mathematical/statistics background with ability to understand algorithms and methods from a mathematical viewpoint and an intuitive viewpoint).

In this role you will be expected to:

  • Utilize statistical natural language processing to mine unstructured data, and create insights; analyze and model structured data using advanced statistical methods and implement algorithms and software needed to perform analyses
  • Build document clustering, topic analysis, text classification, named entity recognition, sentiment analysis, and part-of-speech tagging methods for unstructured and semi-structured data
  • Cluster and analyze large amounts of user generated content and process data in large-scale environments using Amazon EC2, Storm, Hadoop and Spark
  • Develop and perform text classification using methods such as logistic regression, decision trees, support vector machines and maximum entropy classifiers
  • Develop methods to support and drive client engagements focused on Big Data and Advanced Business Analytics, in diverse domains such as product development, marketing research, public policy, optimization, and risk management; communicate results and educate others through reports and presentations
  • Perform text mining, generate and test working hypotheses, prepare and analyze historical data and identify patterns


  • Four years of professional experience working in Natural Language Processing or related field
  • Experience with command-line scripting, data structures and algorithms and ability to work in a Linux environment, processing large amounts of data in a cloud environment
  • Strong data extraction and processing, using MapReduce, Pig, and/or Hive preferred
  • Applicants must be currently authorized to work in the United States without the need for visa sponsorship now or in the future