Example of aligning your annotation to the original text

This example shows how you can use PubAnnotation to align your annotation into the original text.

Suppose you once took a sentence from the PubMed abstract 25314077 and annotated it for gene names using Entrez Gene IDs as follows:

{"text":"Here, we identify the interaction between the NF-kappaB-regulated antiapoptotic factor GADD45beta and the JNK kinase MKK7 as a therapeutic target in MM.","denotations":[{"id":"T1","span":{"begin":46,"end":55},"obj":"Gene:4790"},{"id":"T2","span":{"begin":87,"end":97},"obj":"Gene:4616"},{"id":"T3","span":{"begin":117,"end":121},"obj":"Gene:5609"},{"id":"T4","span":{"begin":106,"end":109},"obj":"Gene:5599"}]}

Its JSON encoding may look like as follows (indented for easy of reading):

{
"text":"Here, we identify the interaction between the NF-kappaB-regulated antiapoptotic factor GADD45beta and the JNK kinase MKK7 as a therapeutic target in MM.",
"denotations":[
	{"id":"T1","span":{"begin":46,"end":55},"obj":"Gene:4790"},
	{"id":"T2","span":{"begin":87,"end":97},"obj":"Gene:4616"},
	{"id":"T3","span":{"begin":117,"end":121},"obj":"Gene:5609"},
	{"id":"T4","span":{"begin":106,"end":109},"obj":"Gene:5599"}
]
}

Suppose you’ve stored the annotation in the json file, PubMed-25314077-gene.json. You can open the file in the TextAE editor.

Now you want to examine the whole abstract, and for tht you want to bring the annotation into the source abstract. To do it, you first need to find the location of the sentence in the abstract, then update the position indice (begin and end) of the annotation. Below is the abstract as extracted from PubMed. Note that the sentence is highlighted for convenience of reading:

PubMed:25314077 Cancer-selective targeting of the NF-κB survival pathway with GADD45β/MKK7 inhibitors. Constitutive NF-κB signaling promotes survival in multiple myeloma (MM) and other cancers; however, current NF-κB-targeting strategies lack cancer cell specificity. Here, we identify the interaction between the NF-κB-regulated antiapoptotic factor GADD45β and the JNK kinase MKK7 as a therapeutic target in MM. Using a drug-discovery strategy, we developed DTP3, a D-tripeptide, which disrupts the GADD45β/MKK7 complex, kills MM cells effectively, and, importantly, lacks toxicity to normal cells. DTP3 has similar anticancer potency to the clinical standard, bortezomib, but more than 100-fold higher cancer cell specificity in vitro. Notably, DTP3 ablates myeloma xenografts in mice with no apparent side effects at the effective doses. Hence, cancer-selective targeting of the NF-κB pathway is possible and, at least for myeloma patients, promises a profound benefit.

With PubAnnotation, the task is very easy to accomplish. You can just send the json file to PubAnnotation as follows:

The above command uses the cURL tool

  • to send (by the HTTP POST method) the file, PubMed-25314077-gene.json,
  • to the URL, https://pubannotation.org/docs/sourcedb/PubMed/sourceid/25314077/annotations.json,
  • then stores the output to the file, PubMed-25314077-gene-aligned.json

Below is the result of the cURL command:

PubMed:25314077 {"text":"Cancer-selective targeting of the NF-κB survival pathway with GADD45β/MKK7 inhibitors.\nConstitutive NF-κB signaling promotes survival in multiple myeloma (MM) and other cancers; however, current NF-κB-targeting strategies lack cancer cell specificity. Here, we identify the interaction between the NF-κB-regulated antiapoptotic factor GADD45β and the JNK kinase MKK7 as a therapeutic target in MM. Using a drug-discovery strategy, we developed DTP3, a D-tripeptide, which disrupts the GADD45β/MKK7 complex, kills MM cells effectively, and, importantly, lacks toxicity to normal cells. DTP3 has similar anticancer potency to the clinical standard, bortezomib, but more than 100-fold higher cancer cell specificity in vitro. Notably, DTP3 ablates myeloma xenografts in mice with no apparent side effects at the effective doses. Hence, cancer-selective targeting of the NF-κB pathway is possible and, at least for myeloma patients, promises a profound benefit.","denotations":[{"id":"T1","span":{"begin":298,"end":303},"obj":"Gene:4790"},{"id":"T2","span":{"begin":335,"end":342},"obj":"Gene:4616"},{"id":"T3","span":{"begin":362,"end":366},"obj":"Gene:5609"},{"id":"T4","span":{"begin":351,"end":354},"obj":"Gene:5599"}]}

Below is the TextAE rendering of it:

{"text":"Cancer-selective targeting of the NF-\u03baB survival pathway with GADD45\u03b2/MKK7 inhibitors.\nConstitutive NF-\u03baB signaling promotes survival in multiple myeloma (MM) and other cancers; however, current NF-\u03baB-targeting strategies lack cancer cell specificity. Here, we identify the interaction between the NF-\u03baB-regulated antiapoptotic factor GADD45\u03b2 and the JNK kinase MKK7 as a therapeutic target in MM. Using a drug-discovery strategy, we developed DTP3, a D-tripeptide, which disrupts the GADD45\u03b2/MKK7 complex, kills MM cells effectively, and, importantly, lacks toxicity to normal cells. DTP3 has similar anticancer potency to the clinical standard, bortezomib, but more than 100-fold higher cancer cell specificity in vitro. Notably, DTP3 ablates myeloma xenografts in mice with no apparent side effects at the effective doses. Hence, cancer-selective targeting of the NF-\u03baB pathway is possible and, at least for myeloma patients, promises a profound benefit.","denotations":[{"id":"T6","span":{"begin":298,"end":303},"obj":"Gene:4790"},{"id":"T7","span":{"begin":335,"end":342},"obj":"Gene:4616"},{"id":"T9","span":{"begin":362,"end":366},"obj":"Gene:5609"},{"id":"T8","span":{"begin":351,"end":354},"obj":"Gene:5599"}]}