Data and Text Processing for Health and Life Sciences.

Saved in:
Bibliographic Details
Superior document:Advances in Experimental Medicine and Biology Series ; v.1137
:
Place / Publishing House:Cham : : Springer International Publishing AG,, 2019.
Ã2019.
Year of Publication:2019
Edition:1st ed.
Language:English
Series:Advances in Experimental Medicine and Biology Series
Online Access:
Physical Description:1 online resource (107 pages)
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 05168nam a22004093i 4500
001 5005788423
003 MiAaPQ
005 20240229073832.0
006 m o d |
007 cr cnu||||||||
008 240229s2019 xx o ||||0 eng d
020 |a 9783030138455  |q (electronic bk.) 
020 |z 9783030138448 
035 |a (MiAaPQ)5005788423 
035 |a (Au-PeEL)EBL5788423 
035 |a (OCoLC)1106161463 
040 |a MiAaPQ  |b eng  |e rda  |e pn  |c MiAaPQ  |d MiAaPQ 
050 4 |a RC261-271 
100 1 |a Couto, Francisco M. 
245 1 0 |a Data and Text Processing for Health and Life Sciences. 
250 |a 1st ed. 
264 1 |a Cham :  |b Springer International Publishing AG,  |c 2019. 
264 4 |c Ã2019. 
300 |a 1 online resource (107 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
490 1 |a Advances in Experimental Medicine and Biology Series ;  |v v.1137 
505 0 |a Intro -- Preface -- Acknowledgments -- Contents -- Acronyms -- 1 Introduction -- Biomedical Data Repositories -- Scientific Text -- Amount of Text -- Ambiguity and Contextualization -- Biomedical Ontologies -- Programming Skills -- Why This Book? -- Third-Party Solutions -- Simple Pipelines -- How This Book Helps Health and Life Specialists? -- Shell Scripting -- Text Files -- Relational Databases -- What Is in the Book? -- Command Line Tools -- Pipelines -- Regular Expressions -- Semantics -- 2 Resources -- Biomedical Text -- What? -- Where? -- How? -- Semantics -- What? -- Languages -- Formality -- Gold Related Documents -- Where? -- OBO Ontologies -- Popular Controlled Vocabularies -- How? -- OWL -- URI -- Further Reading -- 3 Data Retrieval -- Caffeine Example -- Unix Shell -- Current Directory -- Windows Directories -- Change Directory -- Useful Key Combinations -- Shell Version -- Data File -- File Contents -- Reverse File Contents -- My First Script -- Line Breaks -- Redirection Operator -- Installing Tools -- Permissions -- Debug -- Save Output -- Web Identifiers -- Single and Double Quotes -- Comments -- Data Retrieval -- Standard Error Output -- Data Extraction -- Single and Multiple Patterns -- Data Elements Selection -- Task Repetition -- Assembly Line -- File Header -- Variable -- XML Processing -- Human Proteins -- PubMed Identifiers -- PubMed Identifiers Extraction -- Duplicate Removal -- Complex Elements -- XPath -- Namespace Problems -- Only Local Names -- Queries -- Extracting XPath Results -- Text Retrieval -- Publication URL -- Title and Abstract -- Disease Recognition -- Further Reading -- 4 Text Processing -- Pattern Matching -- Case Insensitive Matching -- Number of Matches -- Invert Match -- File Differences -- Evaluation Metrics -- Word Matching -- Regular Expressions -- Extended Syntax -- Alternation -- Basic Syntax. 
505 8 |a Scope -- Multiple Alternatives -- Multiple Characters -- Spaces -- Groups -- Ranges -- Negation -- Quantifiers -- Optional -- Multiple and Optional -- Multiple and Compulsory -- All Options -- Position -- Beginning -- Ending -- Near the End -- Word in Between -- Full Line -- Match Position -- Tokenization -- Character Delimiters -- Wrong Tokens -- String Replacement -- Multi-character Delimiters -- Keep Delimiters -- Sentences File -- Entity Recognition -- Select the Sentence -- Pattern File -- Relation Extraction -- Multiple Filters -- Relation Type -- Remove Relation Types -- Further Reading -- 5 Semantic Processing -- Classes -- OWL Files -- Class Label -- Class Definition -- Related Classes -- URIs and Labels -- URI of a Label -- Label of a URI -- Synonyms -- URI of Synonyms -- Parent Classes -- Labels of Parents -- Related Classes -- Labels of Related Classes -- Ancestors -- Grandparents -- Root Class -- Recursion -- Iteration -- My Lexicon -- Ancestors Labels -- Merging Labels -- Ancestors Matched -- Generic Lexicon -- All Labels -- Problematic Entries -- Special Characters Frequency -- Completeness -- Removing Special Characters -- Removing Extra Terms -- Removing Extra Spaces -- Disease Recognition -- Performance -- Inverted Recognition -- Case Insensitive -- ASCII Encoding -- Correct Matches -- Incorrect Matches -- Entity Linking -- Modified Labels -- Ambiguity -- Surrounding Entities -- Semantic Similarity -- Measures -- DiShIn Installation -- Database File -- DiShIn Execution -- Large Lexicons -- MER Installation -- Lexicon Files -- MER Execution -- Further Reading -- Bibliography -- Index. 
588 |a Description based on publisher supplied metadata and other sources. 
590 |a Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2024. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries.  
655 4 |a Electronic books. 
776 0 8 |i Print version:  |a Couto, Francisco M.  |t Data and Text Processing for Health and Life Sciences  |d Cham : Springer International Publishing AG,c2019  |z 9783030138448 
797 2 |a ProQuest (Firm) 
830 0 |a Advances in Experimental Medicine and Biology Series 
856 4 0 |u https://ebookcentral.proquest.com/lib/oeawat/detail.action?docID=5788423  |z Click to View