View Our Website View All Jobs

Linguist, Ontology (8297532)

We're looking for linguists to develop data infrastructure for Natural Language Processing systems across products. We need people who can articulate their positions and intuitions while seeking common ground with others.

We value expertise in any sub-field of linguistics. No matter your experience, you should be passionate about your area and arrive ready to describe your favorite research project. You should have experience developing/evaluating theories that capture linguistic generalizations while also being precise, concise, and implementable.

You are open to programming and data processing techniques; you have written some code and are willing to learn more. You should understand basic programming concepts: you know what a variable is and the purpose of functions. You don’t have to be a wizard, but at a minimum, you can write scripts for manipulating text files.

You know that language data is the underpinning of all current language technologies. You love practical language analysis as well as theoretical. You understand basic statistics and experimental design. You bring a unique skill set to the table and are eager to adapt it to a diverse range of products and problems.

The successful candidate will be engaged in general responsibilities and at least one specialized role.

General Responsibilities

  • Collaboratively develop complex and consistent linguistic analyses
  • Pilot new label sets to test their feasibility
  • Determine quality standards for annotation
  • Annotate language data with linguistic information
  • Curate large amounts of annotated data by analyzing annotations, comparing annotations to guidelines, and building gold analyses

Specialized Roles

  • An Ontology Manager to create and revise hierarchical label sets for natural language based on the needs of the product team
  • A Tools Developer to generate tools for data annotation, data storage, and quality evaluation with an eye towards the long-term
  • A Data Quality Engineer to monitor data collection, compile statistics, and ensure data quality
  • A Technical Writer to distill complex concepts and group consensus into accessible guidelines
  • A Grammar Developer to write regular expressions and Context-Free Grammars for data validation, named entity recognition, information extraction, etc.
  • A Dialog Generator to conceptualize and create systems for Natural Language Generation
  • A Designer to connect linguistic analyses and system APIs to provide a smooth user experience

Qualifications

  • Academic degree in Linguistics, Computational Linguistics, Language Technologies, or related field
  • Interest and experience in various areas of linguistics, including pragmatics, semantics, syntax, morphology, phonology, phonetics, discourse analysis, sociolinguistics, psycholinguistics, computational linguistics, and field work
  • Ability to formulate, articulate, and describe concepts and hypotheses that are broad enough to capture high-level concepts and specific enough to formalize as rules
  • Collaborative and solution-oriented attitude
  • Eagerness to learn new skills and adapt to a changing environment
  • Strong problem solving and analytical skills
  • Enthusiasm for detail work and ability to focus for significant portions of the work day
  • Experience with basic programming techniques and familiarity with languages such as Python, PHP, Perl, and C#
  • Ability to speak and write in English fluently and idiomatically

Preferred Qualifications

  • Advanced degree and/or industry experience
  • Fluency in two or more natural languages
  • Familiarity with version control, unit tests, and other programming best practices
Read More

Apply for this position

Required*
Apply with Indeed
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

150