Named Entity Recognition

Label words or phrases within text using the Universal Data Tool

Setup the Dataset

Navigate to udt.dev and click "New File"

Then select the Named Entity Recognition button from the Setup > Data Type page.

You can now configure the interface you'd like for you Named Entity Recognition dataset by adding any labels you'd like to display per sample.

Import Data

You can use any of the following methods to import text data.

Label your Data (with friends!)

Go to the Label tab to begin labeling data. See the Collaborative Labeling Guide to label with friends or a team of your labelers.

Export and Use

The easiest way to use a Named Entity Recognition dataset is using the JSON format. Use the "Download JSON" button at the top when you're done labeling and check out the Named Entity Recognition JSON Specification.

Here's what a JSON sample looks like in the resultant dataset:

{
  "document": "This strainer makes a great hat, I'll wear it while I serve spaghetti",
  "annotation": {
    "entities": [
      { text: "strainer", label: "hat", start: 5, end: 12 },
      { text: "spaghetti", label: "food", start: 60, end: 68 }
    ]
  }
}

Last updated