py-umls package

This package contains three modules with classes useful for dealing with RxNorm, then a module each for UMLS and SNOMED handling.

rxnorm

Provides classes that deal with RxNorm. This is very much WiP!

class rxnorm.RxNorm

Bases: object

A class for handling RxNorm in an SQLite database and performing a handful of RxNorm-related tasks.

classmethod check_database()

Check if our database is in place and if not, import them. Will raise on errors!

RxNorm: (rxnorm.db) If missing prompt to use the rxnorm.sh script

classmethod ndc_normalize(ndc)

Normalizes an NDC (National Drug Code) number.

The pseudo-code published by NIH (http://www.nlm.nih.gov/research/umls/rxnorm/NDC_Normalization_Code.rtf) first identifies the format (e.g. “6-3-2”) and then normalizes based on that finding. However since the normalized string is always 5-4-2, padded with leading zeroes and removing all dashes afterwards, this implementation goes a much simpler route.

NDCs that only contain one dash are treated as if they were missing the package specifier, so they get a “-00” appended before normalization.

Parameters:ndc (str) – The NDC to normalize as string
Returns:A string with the normalized NDC, or None if the number couldn’t be normalized
class rxnorm.RxNormCUI(rxcui, label=None)

Bases: graphable.GraphableObject

deliver_to(dot_context, is_leaf)
find_relations(to_rxcui=None, max_width=10)
relations = None
rxcui = None
rxlookup = <rxnorm.RxNormLookup object at 0x10d839550>
ttys
update_self_from_rxcui()
update_shape_from_ttys()
class rxnorm.RxNormConceptRelation(rxcuiobj1, rela, rxcuiobj2)

Bases: graphable.GraphableRelation

rxcui1 = None
rxcui2 = None
class rxnorm.RxNormLookup

Bases: object

Class for RxNorm lookup.

cache_drug_class = False
can_cache()
execute(sql, params=())

Execute and return the pointer of an SQLite execute() query.

fetchAll(sql, params=())

Execute and return the result of fetchall() on a raw SQL query.

fetchOne(sql, params=())

Execute and return the result of fetchone() on a raw SQL query.

friendly_class_format(va_name)

Tries to reformat the VA drug class name so it’s suitable for display.

Returns a set of tuples containing the RXCUI and the actual relation for the desired relation, or all if the relation is not specified.

Parameters:
  • rxcui (str) – The RXCUI for which to look up relations
  • relation (str) – Optional: the type of the relation, e.g. “has_ingredient”
  • to_rxcui (str) – An optional second rxcui, to return all relations between the two given rxcuis. Ignored if relation is present.
Returns:

A set of tuples, where tuples are (rxcui, rela)

lookup_rxcui(rxcui, preferred=True)

Return a tuple with (str, tty, rxcui, rxaui) or - if “preferred” is False - a tuple with (preferred-name, list-of-tuples)

lookup_rxcui_name(rxcui, preferred=True, no_html=True)

Return a string or HTML for the meaning of the given code. If preferred is True (the default), only one match will be returned, looking for specific TTY and using the “best” one. There is currently NO SUPPORT FOR preferred = False

lookup_tty(rxcui)

Returns a set of TTYs for the given RXCUI.

ndc_for_rxcui(rxcui)

Find the NDC from our NDC-cache-table for the given RXCUI.

prepare_to_cache_classes()
rxcui_for_name(name, limit_tty=None)

Tries to find an RXCUI for the concept name.

Does this by performing a “starts with” against the STR column on RXNCONSO, then replaces any spaces with wildcards and finally chops off one word after the other until a match is found.

This works but is slow and far from perfect. RxNav’s approxMatch is definitely better, you can use rxcui_for_name_approx to get an RXCUI using that service.

Parameters:
  • name (str) – The name to get an RXCUI for
  • limit_tty (list) – Optional: limit search to a given list of TTYs
Returns:

The best matching rxcui, if any, as string

rxcui_for_name_approx(name)

Returns the best approxMatch RXCUI as found when using RxNav’s service against the provided name. Runs synchronously.

Parameters:name (str) – The name to get an RXCUI for
Returns:The top ranked rxcui, if any, as string
rxcui_for_ndc(ndc)

Find the RXCUI for the given NDC from our NDC-cache-table.

This method only does exact lookup for now, it should be extended to use normalized NDC formats.

Parameters:ndc (str) – The NDC to look up
Returns:The matching RXCUI as string, or None
rxnav_approx_match(name, nmax=10)

Returns the top #nmax approximateTerm rxcuis as found when using RxNav’s service against the provided name. Runs synchronously.

Parameters:
  • name (str) – The name to get an RXCUI for
  • nmax (int) – The maximum number of unique rxcuis to return, 10 by default
Returns:

The top ranked rxcuis, if any, as a list

sqlite = None
va_drug_class(rxcui)

Returns a list of VA class names for a given RXCUI. EXPERIMENTAL.

rxnorm_graph

A useful script to help visualize relationships between RxNorm concepts, starting from a given RXCUI. Just run this script in your command line and follow the leader.

umls

Module to deal with UMLS lexica.

class umls.UMLS

Bases: object

A class for importing UMLS terminologies into an SQLite database.

classmethod check_database()

Check if our database is in place and if not, prompts to import it. Will raise on errors!

UMLS: (umls.db) If missing prompt to use the umls.sh script

class umls.UMLSLookup

Bases: object

UMLS lookup

did_check_dbs = False
lookup_code(cui, preferred=True)

Return a list with triples that contain: - name - source - semantic type by looking it up in our “descriptions” database. The “preferred” settings has the effect that only names from SNOMED (SNOMEDCD) and the Metathesaurus (MTH) will be reported. A lookup in our “descriptions” table is much faster than combing through the full MRCONSO table.

Returns:A list of triples with (name, sab, sty)
lookup_code_for_name(name, preferred=True)

Tries to find a good concept code for the given concept name.

Uses our indexed descriptions table.

Returns:A list of triples with (cui, sab, sty)
lookup_code_meaning(cui, preferred=True, no_html=True)

Return a string (an empty string if the cui is null or not found) by looking it up in our “descriptions” database. The “preferred” settings has the effect that only names from SNOMED (SNOMEDCD) and the Metathesaurus (MTH) will be reported. A lookup in our “descriptions” table is much faster than combing through the full MRCONSO table.

preferred_sources = ['"SNOMEDCT"', '"MTH"']
sqlite = None

snomed

Module to deal with the SNOMED terminology.

class snomed.SNOMED

Bases: object

A class for importing UMLS terminologies into an SQLite database.

classmethod check_database()

Check if our database is in place and if not, prompts to create it. Will raise on errors!

SNOMED: (snomed.db) Read SNOMED CT from tab-separated files and create an SQLite database.

classmethod did_import(table_name)

Allows us to set hooks after tables have been imported

classmethod find_needed_files(snomed_dir)
classmethod import_csv_into_table(snomed_file, table_name)

Import SNOMED CSV into our SQLite database. The SNOMED CSV files can be parsed by Python’s CSV parser with the “excel-tab” flavor.

classmethod import_from_files(rx_map)
classmethod insert_query_for(table_name)

Returns the insert query needed for the given table

classmethod insert_tuple_from_csv_row_for(table_name, row)
classmethod setup_tables()

Creates the SQLite tables we need, not the tables we deserve. Does nothing if the tables/indexes already exist

sqlite_handle = None
class snomed.SNOMEDLookup

Bases: object

SNOMED lookup

lookup_code_meaning(snomed_id, preferred=True, no_html=True)

Returns HTML for all matches of the given SNOMED id. The “preferred” flag here currently has no function.

sqlite = None

graphable

Provides classes that can be used to create an interdependency graph.

class graphable.DotContext(max_depth=None, max_width=None)

Bases: object

announce(obj)
deliver(obj)
depth = 0
get()
items = None
max_depth = 8
max_width = 15
source = None
class graphable.GraphableObject(name, label=None)

Bases: object

announce_to(dot_context, via=None)

Announce the receiver to the context.

Subclasses MUST NOT announce other graphable objects they are holding on to here but they MUST announce them in “deliver_to” if appropriate.

  • dot_context The context to announce to

  • via If not-None the other GraphableObject that is responsible for

    announcing the receiver

announced_via = None
color = None
deliver_to(dot_context, is_leaf)

Call the context’s “deliver” method.

This method is guaranteed to only be called once per context. Hence subclasses that hold on to other graphable objects MUST ANNOUNCE those instances here (but NOT deliver them) but ONLY IF “is_leaf” is not True. - dot_context The context to deliver to - is_leaf If True means the receiver is intended to be a leaf object

dot_representation()
inner_dot()
label = None
name
shape = None
style = None
class graphable.GraphableRelation(rel_from, label, rel_to)

Bases: graphable.GraphableObject

deliver_to(dot_context, is_leaf)
dot_representation()
relation_from = None
relation_to = None
class graphable.GraphvizGraphic(out_file='rxgraph.png')

Bases: object

cmd = 'dot'
executableCommand(infile)
max_depth = None
max_width = None
out_dot = None
out_file = None
out_type = 'pdf'
write_dot_graph(obj)

sqlite

Our SQLite connection class.

class sqlite.CSVImporter(csv_path, tablename='rows')

Bases: object

A simple CSV to SQLite importer class.

Expects a CSV file with a header row, will create a table reflecting the header row and import all rows.

import_to(dbpath, csv_format='excel')
sqlite_handle(dbpath)
class sqlite.SQLite(database=None)

Bases: object

SQLite access

close()
commit()
connect()
create(table_name, table_structure)

Executes a CREATE TABLE IF NOT EXISTS query with the given structure. Input is NOT sanitized, watch it!

execute(sql, params=())

Executes an SQL command and returns the cursor.execute, which can be used as an iterator. Supply the params as tuple, i.e. (param,) and (param1, param2, ...)

executeInsert(sql, params=())

Executes an SQL command (should be INSERT OR REPLACE) and returns the last row id, 0 on failure.

executeOne(sql, params)

Returns the first row returned by executing the command

executeUpdate(sql, params=())

Executes an SQL command (should be UPDATE) and returns the number of affected rows.

classmethod get(database)

Use this to get SQLite instances for a given database. Avoids creating multiple instances for the same database.

We keep instances around per thread per database, maybe there should be a way to turn this off. However, here we always release instances for threads that are no longer alive. If this is better than just always creating a new instance should be tested.

hasTable(table_name)

Returns whether the given table exists.

rollback()

Table Of Contents

Previous topic

py-umls

This Page