reduce-umls-terms

0.0.5 • Public • Published

Reduce UMLS Terms

In UMLS the cui codes access many translations and spelling variations. However, there are a lot of variations available, including terms with only a difference in capitalization or symbols (such as a semicolon) added.

This module removes duplicate entries from a list UMLS terms

Installation

Use npm:

$ npm install reduce-umls-terms

How to

var reducer = require('reduce-umls-terms');

var entry = [
    "Reumatoïde artritis",
    "Artritis, reumatoïde",
    "artritis; atrofisch",
    "artritis; reumatoïd",
    "atrofisch; artritis",
    "reumatoïd; artritis",
    "Reumatoïde artritis, niet gespecificeerd",
    "RA",
    "atrofische artritis",
    "reumatische artritis",
    "reumatoïde artritis",
    "reumatoïde artritis",
    "reumatoïde artritis",
    "reumatoïde artritis",
    "systemische reumatoïde artritis"
];

var terms = reducer.reduce(entry);
    terms = reducer.remove_container_duplicates(terms);

/* Gives
[
    'reumatoide artritis',
    'ra',
    'atrofische artritis',
    'reumatische artritis',
    'artritis reumatoide',
    'artritis atrofisch',
    'artritis reumatoid',
    'atrofisch artritis',
    'reumatoid artritis'
]
*/

// It is possible to further reduce the list of terms
// if the order does not matter (say "reumatoid artritis" == "artritis rematoid")

terms = reducer.remove_order_duplicates(terms);

/* Now gives
[
  'ra',
  'reumatoide artritis',
  'atrofische artritis',
  'reumatische artritis',
  'artritis atrofisch',
  'artritis reumatoid'
]
*/

Package Sidebar

Install

npm i reduce-umls-terms

Weekly Downloads

6

Version

0.0.5

License

MIT

Last publish

Collaborators

  • tessmore