Data and Text Processing for Health and Life Sciences.

Saved in:
Bibliographic Details
Superior document:Advances in Experimental Medicine and Biology Series ; v.1137
:
Place / Publishing House:Cham : : Springer International Publishing AG,, 2019.
Ã2019.
Year of Publication:2019
Edition:1st ed.
Language:English
Series:Advances in Experimental Medicine and Biology Series
Online Access:
Physical Description:1 online resource (107 pages)
Tags: Add Tag
No Tags, Be the first to tag this record!
id 5005788423
ctrlnum (MiAaPQ)5005788423
(Au-PeEL)EBL5788423
(OCoLC)1106161463
collection bib_alma
record_format marc
spelling Couto, Francisco M.
Data and Text Processing for Health and Life Sciences.
1st ed.
Cham : Springer International Publishing AG, 2019.
Ã2019.
1 online resource (107 pages)
text txt rdacontent
computer c rdamedia
online resource cr rdacarrier
Advances in Experimental Medicine and Biology Series ; v.1137
Intro -- Preface -- Acknowledgments -- Contents -- Acronyms -- 1 Introduction -- Biomedical Data Repositories -- Scientific Text -- Amount of Text -- Ambiguity and Contextualization -- Biomedical Ontologies -- Programming Skills -- Why This Book? -- Third-Party Solutions -- Simple Pipelines -- How This Book Helps Health and Life Specialists? -- Shell Scripting -- Text Files -- Relational Databases -- What Is in the Book? -- Command Line Tools -- Pipelines -- Regular Expressions -- Semantics -- 2 Resources -- Biomedical Text -- What? -- Where? -- How? -- Semantics -- What? -- Languages -- Formality -- Gold Related Documents -- Where? -- OBO Ontologies -- Popular Controlled Vocabularies -- How? -- OWL -- URI -- Further Reading -- 3 Data Retrieval -- Caffeine Example -- Unix Shell -- Current Directory -- Windows Directories -- Change Directory -- Useful Key Combinations -- Shell Version -- Data File -- File Contents -- Reverse File Contents -- My First Script -- Line Breaks -- Redirection Operator -- Installing Tools -- Permissions -- Debug -- Save Output -- Web Identifiers -- Single and Double Quotes -- Comments -- Data Retrieval -- Standard Error Output -- Data Extraction -- Single and Multiple Patterns -- Data Elements Selection -- Task Repetition -- Assembly Line -- File Header -- Variable -- XML Processing -- Human Proteins -- PubMed Identifiers -- PubMed Identifiers Extraction -- Duplicate Removal -- Complex Elements -- XPath -- Namespace Problems -- Only Local Names -- Queries -- Extracting XPath Results -- Text Retrieval -- Publication URL -- Title and Abstract -- Disease Recognition -- Further Reading -- 4 Text Processing -- Pattern Matching -- Case Insensitive Matching -- Number of Matches -- Invert Match -- File Differences -- Evaluation Metrics -- Word Matching -- Regular Expressions -- Extended Syntax -- Alternation -- Basic Syntax.
Scope -- Multiple Alternatives -- Multiple Characters -- Spaces -- Groups -- Ranges -- Negation -- Quantifiers -- Optional -- Multiple and Optional -- Multiple and Compulsory -- All Options -- Position -- Beginning -- Ending -- Near the End -- Word in Between -- Full Line -- Match Position -- Tokenization -- Character Delimiters -- Wrong Tokens -- String Replacement -- Multi-character Delimiters -- Keep Delimiters -- Sentences File -- Entity Recognition -- Select the Sentence -- Pattern File -- Relation Extraction -- Multiple Filters -- Relation Type -- Remove Relation Types -- Further Reading -- 5 Semantic Processing -- Classes -- OWL Files -- Class Label -- Class Definition -- Related Classes -- URIs and Labels -- URI of a Label -- Label of a URI -- Synonyms -- URI of Synonyms -- Parent Classes -- Labels of Parents -- Related Classes -- Labels of Related Classes -- Ancestors -- Grandparents -- Root Class -- Recursion -- Iteration -- My Lexicon -- Ancestors Labels -- Merging Labels -- Ancestors Matched -- Generic Lexicon -- All Labels -- Problematic Entries -- Special Characters Frequency -- Completeness -- Removing Special Characters -- Removing Extra Terms -- Removing Extra Spaces -- Disease Recognition -- Performance -- Inverted Recognition -- Case Insensitive -- ASCII Encoding -- Correct Matches -- Incorrect Matches -- Entity Linking -- Modified Labels -- Ambiguity -- Surrounding Entities -- Semantic Similarity -- Measures -- DiShIn Installation -- Database File -- DiShIn Execution -- Large Lexicons -- MER Installation -- Lexicon Files -- MER Execution -- Further Reading -- Bibliography -- Index.
Description based on publisher supplied metadata and other sources.
Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2024. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries.
Electronic books.
Print version: Couto, Francisco M. Data and Text Processing for Health and Life Sciences Cham : Springer International Publishing AG,c2019 9783030138448
ProQuest (Firm)
Advances in Experimental Medicine and Biology Series
https://ebookcentral.proquest.com/lib/oeawat/detail.action?docID=5788423 Click to View
language English
format eBook
author Couto, Francisco M.
spellingShingle Couto, Francisco M.
Data and Text Processing for Health and Life Sciences.
Advances in Experimental Medicine and Biology Series ;
Intro -- Preface -- Acknowledgments -- Contents -- Acronyms -- 1 Introduction -- Biomedical Data Repositories -- Scientific Text -- Amount of Text -- Ambiguity and Contextualization -- Biomedical Ontologies -- Programming Skills -- Why This Book? -- Third-Party Solutions -- Simple Pipelines -- How This Book Helps Health and Life Specialists? -- Shell Scripting -- Text Files -- Relational Databases -- What Is in the Book? -- Command Line Tools -- Pipelines -- Regular Expressions -- Semantics -- 2 Resources -- Biomedical Text -- What? -- Where? -- How? -- Semantics -- What? -- Languages -- Formality -- Gold Related Documents -- Where? -- OBO Ontologies -- Popular Controlled Vocabularies -- How? -- OWL -- URI -- Further Reading -- 3 Data Retrieval -- Caffeine Example -- Unix Shell -- Current Directory -- Windows Directories -- Change Directory -- Useful Key Combinations -- Shell Version -- Data File -- File Contents -- Reverse File Contents -- My First Script -- Line Breaks -- Redirection Operator -- Installing Tools -- Permissions -- Debug -- Save Output -- Web Identifiers -- Single and Double Quotes -- Comments -- Data Retrieval -- Standard Error Output -- Data Extraction -- Single and Multiple Patterns -- Data Elements Selection -- Task Repetition -- Assembly Line -- File Header -- Variable -- XML Processing -- Human Proteins -- PubMed Identifiers -- PubMed Identifiers Extraction -- Duplicate Removal -- Complex Elements -- XPath -- Namespace Problems -- Only Local Names -- Queries -- Extracting XPath Results -- Text Retrieval -- Publication URL -- Title and Abstract -- Disease Recognition -- Further Reading -- 4 Text Processing -- Pattern Matching -- Case Insensitive Matching -- Number of Matches -- Invert Match -- File Differences -- Evaluation Metrics -- Word Matching -- Regular Expressions -- Extended Syntax -- Alternation -- Basic Syntax.
Scope -- Multiple Alternatives -- Multiple Characters -- Spaces -- Groups -- Ranges -- Negation -- Quantifiers -- Optional -- Multiple and Optional -- Multiple and Compulsory -- All Options -- Position -- Beginning -- Ending -- Near the End -- Word in Between -- Full Line -- Match Position -- Tokenization -- Character Delimiters -- Wrong Tokens -- String Replacement -- Multi-character Delimiters -- Keep Delimiters -- Sentences File -- Entity Recognition -- Select the Sentence -- Pattern File -- Relation Extraction -- Multiple Filters -- Relation Type -- Remove Relation Types -- Further Reading -- 5 Semantic Processing -- Classes -- OWL Files -- Class Label -- Class Definition -- Related Classes -- URIs and Labels -- URI of a Label -- Label of a URI -- Synonyms -- URI of Synonyms -- Parent Classes -- Labels of Parents -- Related Classes -- Labels of Related Classes -- Ancestors -- Grandparents -- Root Class -- Recursion -- Iteration -- My Lexicon -- Ancestors Labels -- Merging Labels -- Ancestors Matched -- Generic Lexicon -- All Labels -- Problematic Entries -- Special Characters Frequency -- Completeness -- Removing Special Characters -- Removing Extra Terms -- Removing Extra Spaces -- Disease Recognition -- Performance -- Inverted Recognition -- Case Insensitive -- ASCII Encoding -- Correct Matches -- Incorrect Matches -- Entity Linking -- Modified Labels -- Ambiguity -- Surrounding Entities -- Semantic Similarity -- Measures -- DiShIn Installation -- Database File -- DiShIn Execution -- Large Lexicons -- MER Installation -- Lexicon Files -- MER Execution -- Further Reading -- Bibliography -- Index.
author_facet Couto, Francisco M.
author_variant f m c fm fmc
author_sort Couto, Francisco M.
title Data and Text Processing for Health and Life Sciences.
title_full Data and Text Processing for Health and Life Sciences.
title_fullStr Data and Text Processing for Health and Life Sciences.
title_full_unstemmed Data and Text Processing for Health and Life Sciences.
title_auth Data and Text Processing for Health and Life Sciences.
title_new Data and Text Processing for Health and Life Sciences.
title_sort data and text processing for health and life sciences.
series Advances in Experimental Medicine and Biology Series ;
series2 Advances in Experimental Medicine and Biology Series ;
publisher Springer International Publishing AG,
publishDate 2019
physical 1 online resource (107 pages)
edition 1st ed.
contents Intro -- Preface -- Acknowledgments -- Contents -- Acronyms -- 1 Introduction -- Biomedical Data Repositories -- Scientific Text -- Amount of Text -- Ambiguity and Contextualization -- Biomedical Ontologies -- Programming Skills -- Why This Book? -- Third-Party Solutions -- Simple Pipelines -- How This Book Helps Health and Life Specialists? -- Shell Scripting -- Text Files -- Relational Databases -- What Is in the Book? -- Command Line Tools -- Pipelines -- Regular Expressions -- Semantics -- 2 Resources -- Biomedical Text -- What? -- Where? -- How? -- Semantics -- What? -- Languages -- Formality -- Gold Related Documents -- Where? -- OBO Ontologies -- Popular Controlled Vocabularies -- How? -- OWL -- URI -- Further Reading -- 3 Data Retrieval -- Caffeine Example -- Unix Shell -- Current Directory -- Windows Directories -- Change Directory -- Useful Key Combinations -- Shell Version -- Data File -- File Contents -- Reverse File Contents -- My First Script -- Line Breaks -- Redirection Operator -- Installing Tools -- Permissions -- Debug -- Save Output -- Web Identifiers -- Single and Double Quotes -- Comments -- Data Retrieval -- Standard Error Output -- Data Extraction -- Single and Multiple Patterns -- Data Elements Selection -- Task Repetition -- Assembly Line -- File Header -- Variable -- XML Processing -- Human Proteins -- PubMed Identifiers -- PubMed Identifiers Extraction -- Duplicate Removal -- Complex Elements -- XPath -- Namespace Problems -- Only Local Names -- Queries -- Extracting XPath Results -- Text Retrieval -- Publication URL -- Title and Abstract -- Disease Recognition -- Further Reading -- 4 Text Processing -- Pattern Matching -- Case Insensitive Matching -- Number of Matches -- Invert Match -- File Differences -- Evaluation Metrics -- Word Matching -- Regular Expressions -- Extended Syntax -- Alternation -- Basic Syntax.
Scope -- Multiple Alternatives -- Multiple Characters -- Spaces -- Groups -- Ranges -- Negation -- Quantifiers -- Optional -- Multiple and Optional -- Multiple and Compulsory -- All Options -- Position -- Beginning -- Ending -- Near the End -- Word in Between -- Full Line -- Match Position -- Tokenization -- Character Delimiters -- Wrong Tokens -- String Replacement -- Multi-character Delimiters -- Keep Delimiters -- Sentences File -- Entity Recognition -- Select the Sentence -- Pattern File -- Relation Extraction -- Multiple Filters -- Relation Type -- Remove Relation Types -- Further Reading -- 5 Semantic Processing -- Classes -- OWL Files -- Class Label -- Class Definition -- Related Classes -- URIs and Labels -- URI of a Label -- Label of a URI -- Synonyms -- URI of Synonyms -- Parent Classes -- Labels of Parents -- Related Classes -- Labels of Related Classes -- Ancestors -- Grandparents -- Root Class -- Recursion -- Iteration -- My Lexicon -- Ancestors Labels -- Merging Labels -- Ancestors Matched -- Generic Lexicon -- All Labels -- Problematic Entries -- Special Characters Frequency -- Completeness -- Removing Special Characters -- Removing Extra Terms -- Removing Extra Spaces -- Disease Recognition -- Performance -- Inverted Recognition -- Case Insensitive -- ASCII Encoding -- Correct Matches -- Incorrect Matches -- Entity Linking -- Modified Labels -- Ambiguity -- Surrounding Entities -- Semantic Similarity -- Measures -- DiShIn Installation -- Database File -- DiShIn Execution -- Large Lexicons -- MER Installation -- Lexicon Files -- MER Execution -- Further Reading -- Bibliography -- Index.
isbn 9783030138455
9783030138448
callnumber-first R - Medicine
callnumber-subject RC - Internal Medicine
callnumber-label RC261-271
callnumber-sort RC 3261 3271
genre Electronic books.
genre_facet Electronic books.
url https://ebookcentral.proquest.com/lib/oeawat/detail.action?docID=5788423
illustrated Not Illustrated
oclc_num 1106161463
work_keys_str_mv AT coutofranciscom dataandtextprocessingforhealthandlifesciences
status_str n
ids_txt_mv (MiAaPQ)5005788423
(Au-PeEL)EBL5788423
(OCoLC)1106161463
carrierType_str_mv cr
hierarchy_parent_title Advances in Experimental Medicine and Biology Series ; v.1137
is_hierarchy_title Data and Text Processing for Health and Life Sciences.
container_title Advances in Experimental Medicine and Biology Series ; v.1137
marc_error Info : Unimarc and ISO-8859-1 translations identical, choosing ISO-8859-1. --- [ 856 : z ]
_version_ 1792331056206053376
fullrecord <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>05168nam a22004093i 4500</leader><controlfield tag="001">5005788423</controlfield><controlfield tag="003">MiAaPQ</controlfield><controlfield tag="005">20240229073832.0</controlfield><controlfield tag="006">m o d | </controlfield><controlfield tag="007">cr cnu||||||||</controlfield><controlfield tag="008">240229s2019 xx o ||||0 eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783030138455</subfield><subfield code="q">(electronic bk.)</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9783030138448</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(MiAaPQ)5005788423</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(Au-PeEL)EBL5788423</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1106161463</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">MiAaPQ</subfield><subfield code="b">eng</subfield><subfield code="e">rda</subfield><subfield code="e">pn</subfield><subfield code="c">MiAaPQ</subfield><subfield code="d">MiAaPQ</subfield></datafield><datafield tag="050" ind1=" " ind2="4"><subfield code="a">RC261-271</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Couto, Francisco M.</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Data and Text Processing for Health and Life Sciences.</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1st ed.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Cham :</subfield><subfield code="b">Springer International Publishing AG,</subfield><subfield code="c">2019.</subfield></datafield><datafield tag="264" ind1=" " ind2="4"><subfield code="c">Ã2019.</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (107 pages)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">computer</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">online resource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Advances in Experimental Medicine and Biology Series ;</subfield><subfield code="v">v.1137</subfield></datafield><datafield tag="505" ind1="0" ind2=" "><subfield code="a">Intro -- Preface -- Acknowledgments -- Contents -- Acronyms -- 1 Introduction -- Biomedical Data Repositories -- Scientific Text -- Amount of Text -- Ambiguity and Contextualization -- Biomedical Ontologies -- Programming Skills -- Why This Book? -- Third-Party Solutions -- Simple Pipelines -- How This Book Helps Health and Life Specialists? -- Shell Scripting -- Text Files -- Relational Databases -- What Is in the Book? -- Command Line Tools -- Pipelines -- Regular Expressions -- Semantics -- 2 Resources -- Biomedical Text -- What? -- Where? -- How? -- Semantics -- What? -- Languages -- Formality -- Gold Related Documents -- Where? -- OBO Ontologies -- Popular Controlled Vocabularies -- How? -- OWL -- URI -- Further Reading -- 3 Data Retrieval -- Caffeine Example -- Unix Shell -- Current Directory -- Windows Directories -- Change Directory -- Useful Key Combinations -- Shell Version -- Data File -- File Contents -- Reverse File Contents -- My First Script -- Line Breaks -- Redirection Operator -- Installing Tools -- Permissions -- Debug -- Save Output -- Web Identifiers -- Single and Double Quotes -- Comments -- Data Retrieval -- Standard Error Output -- Data Extraction -- Single and Multiple Patterns -- Data Elements Selection -- Task Repetition -- Assembly Line -- File Header -- Variable -- XML Processing -- Human Proteins -- PubMed Identifiers -- PubMed Identifiers Extraction -- Duplicate Removal -- Complex Elements -- XPath -- Namespace Problems -- Only Local Names -- Queries -- Extracting XPath Results -- Text Retrieval -- Publication URL -- Title and Abstract -- Disease Recognition -- Further Reading -- 4 Text Processing -- Pattern Matching -- Case Insensitive Matching -- Number of Matches -- Invert Match -- File Differences -- Evaluation Metrics -- Word Matching -- Regular Expressions -- Extended Syntax -- Alternation -- Basic Syntax.</subfield></datafield><datafield tag="505" ind1="8" ind2=" "><subfield code="a">Scope -- Multiple Alternatives -- Multiple Characters -- Spaces -- Groups -- Ranges -- Negation -- Quantifiers -- Optional -- Multiple and Optional -- Multiple and Compulsory -- All Options -- Position -- Beginning -- Ending -- Near the End -- Word in Between -- Full Line -- Match Position -- Tokenization -- Character Delimiters -- Wrong Tokens -- String Replacement -- Multi-character Delimiters -- Keep Delimiters -- Sentences File -- Entity Recognition -- Select the Sentence -- Pattern File -- Relation Extraction -- Multiple Filters -- Relation Type -- Remove Relation Types -- Further Reading -- 5 Semantic Processing -- Classes -- OWL Files -- Class Label -- Class Definition -- Related Classes -- URIs and Labels -- URI of a Label -- Label of a URI -- Synonyms -- URI of Synonyms -- Parent Classes -- Labels of Parents -- Related Classes -- Labels of Related Classes -- Ancestors -- Grandparents -- Root Class -- Recursion -- Iteration -- My Lexicon -- Ancestors Labels -- Merging Labels -- Ancestors Matched -- Generic Lexicon -- All Labels -- Problematic Entries -- Special Characters Frequency -- Completeness -- Removing Special Characters -- Removing Extra Terms -- Removing Extra Spaces -- Disease Recognition -- Performance -- Inverted Recognition -- Case Insensitive -- ASCII Encoding -- Correct Matches -- Incorrect Matches -- Entity Linking -- Modified Labels -- Ambiguity -- Surrounding Entities -- Semantic Similarity -- Measures -- DiShIn Installation -- Database File -- DiShIn Execution -- Large Lexicons -- MER Installation -- Lexicon Files -- MER Execution -- Further Reading -- Bibliography -- Index.</subfield></datafield><datafield tag="588" ind1=" " ind2=" "><subfield code="a">Description based on publisher supplied metadata and other sources.</subfield></datafield><datafield tag="590" ind1=" " ind2=" "><subfield code="a">Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2024. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries. </subfield></datafield><datafield tag="655" ind1=" " ind2="4"><subfield code="a">Electronic books.</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Print version:</subfield><subfield code="a">Couto, Francisco M.</subfield><subfield code="t">Data and Text Processing for Health and Life Sciences</subfield><subfield code="d">Cham : Springer International Publishing AG,c2019</subfield><subfield code="z">9783030138448</subfield></datafield><datafield tag="797" ind1="2" ind2=" "><subfield code="a">ProQuest (Firm)</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Advances in Experimental Medicine and Biology Series</subfield></datafield><datafield tag="856" ind1="4" ind2="0"><subfield code="u">https://ebookcentral.proquest.com/lib/oeawat/detail.action?docID=5788423</subfield><subfield code="z">Click to View</subfield></datafield></record></collection>