Biomedical Health Informatics Graduate Program

Case Western Reserve University

Our Research

Our research is focused on developing new data and metadata representation and analysis techniques for biomedical and healthcare research. Our current research focuses on integrative analysis of brain connectivity data for characterizing spatiotemporal characteristics of epilepsy seizure networks using computational neuroscience approaches. To address the challenges of data quality and scientific reproducibility in data-driven biomedical research we are also developing a provenance metadata framework using provenance ontology and text mining of published articles. Our interdisciplinary research involves close collaboration with clinical, biostatistics, and high performance networking researchers.

Research Interests

Epilepsy seizure networks; Structural connectivity networks derived from MRI; Functional connectivity networks derived from EEG; Provenance metadata; Ontology engineering; Data integration; High performance computing

Brain Connectivity in Neurological Disorders

We study underlying mechanisms that influence the generation and progression of abnormal electrophysiological signals in epilepsy, which is a serious neurological disorder affecting more than 50 million individuals worldwide with debilitating seizures. Our research uses high resolution signal data recorded using intracranial EEG with multiple contacts. However, this approach involves querying and analyzing large volume of multi-modal data. To address this challenge, we incorporate techniques of Big Data analytics, including the development of new data models that are compatible with techniques of large-scale data analysis, such as parallel and distributed computing. We have developed flexible analysis workflows with multiple measures of statistical correlation that can quantitatively assess the strength of the connections among the brain regions active during a seizure event. More information is available on the project page. NIC Workflow Website

Provenance Metadata for Scientific Reproducibility

Scientific reproducibility is key to scientific progress as it allows the research community to build on validated results, protect patients from potentially harmful trial drugs derived from incorrect results, and reduce wastage of valuable resources. To address this challenge in the biomedical research domain, we are developing the Provenance for Clinical and Healthcare Research (ProvCaRe) framework using World Wide Web Consortium (W3C) PROV specifications, including the PROV Ontology (PROV-O). In the ProvCaRe project, we are extending PROV-O to create a formal model of provenance information that is necessary for scientific reproducibility in biomedical research. ProvCaRe framework aims to model, extract, and analyze provenance information. The ProvCaRe framework consists of the S3 Model that extends the PROV specifications to model provenance metadata describing Study Method, Study Tools, Study Data in a research study. We have developed a provenance-specific text processing pipeline that uses the ProvCaRe ontology to identify and extract provenance metadata from published literature describing biomedical research studies. The ProveCaRe knowledge repository contains provenance "triples" extracted from published research studies that can be queried and explored by users using "hypothesis-based search". ProvCaRe Website

Insight: Data Integration and Cohort Studies

Insight is a Semantic Web technology-based platform to support large-scale secondary analysis of healthcare data for neurology clinical research. Insight features the novel use of: (1) provenance metadata, which describes the history or origin of patient data, in clinical research analysis, and (2) support for patient cohort queries across multiple institutions conducting research in epilepsy, and (3) interactive user interface with data exploration features. Insight is being developed as a healthcare informatics infrastructure to support a national network of eight epilepsy research centers across the U.S. funded by the U.S. Centers for Disease Control and Prevention (CDC). Insight uses a set of Common Data Elements (CDE) developed by the Managing Epilepsy Well (MEW) research network members and the Epilepsy and Seizure Ontology (EpSO) for reconciling data heterogeneity and supporting clinical research queries. At present, Insight stores data from 400 participants representing five completed epilepsy research studies.



Yang S, Ghosh K, Sakaie K, Sahoo SS, Carr S, Tatsuoka C, A Simplified Crossing Fiber Model in Diffusion Weighted Imaging, Frontiers in Neuroscience (accepted)

Sahoo SS, Valdez J, Rueschman M, Kim M, Semantic Provenance Graph for Reproducibility of Biomedical Research Studies: Generating and Analyzing Graph Structures from Published Literature, International Medical Informatics Association (IMIA), MedInfo 2019 conference (accepted)

Socrates V, Gershon A, Sahoo SS, Computation of Brain Functional Connectivity Network Measures in Epilepsy: A Web-based Platform for EEG Signal Data Processing and Analysis, International Medical Informatics Association (IMIA), MedInfo 2019 conference (poster) (accepted)

Sahoo SS, Valdez J, Kim M, Rueschman M, Redline S, ProvCaRe: Characterizing scientific reproducibility of biomedical research studies using semantic provenance metadata. International Journal of Medical Informatics, 121, pp.10-18., 2019

Gershon A, Devulapalli P, Zonjy B, Ghosh K, Tatsuoka C, Sahoo SS, Computing Functional Brain Connectivity in Neurological Disorders: Efficient Processing and Retrieval of Electrophysiological Signal Data, AMIA Joint Summits 2019, pp: 107-116

Valdez J, Kim M, Rueschman M, Redline S, Sahoo SS, Classification of Provenance Triples for Scientific Reproducibility: A Comparative Evaluation of Deep Learning Models in the ProvCaRe Project, International Provenance Annotation Workshop (IPAW) 2018 Proceedings, Springer, pp 30-41

Belhajjame K, Garijo D, Sahoo SS, Semantic Web and Provenance for Scientific Reproducibility (SPSR), International Semantic Web Conference 2019 (Tutorial)


Valdez J, Rueschman M, Kim M, Arabyarmohammadi S, Redline S, Sahoo SS, An Extensible Ontology Modeling Approach Using Post Coordinated Expressions for Semantic Provenance in Biomedical Research, The 16th International Conference on. Ontologies, DataBases, and Applications of Semantics (ODBASE), Rhodes, Greece, 2017. pp. 337-352

Valdez J, Kim M, Rueschman M, Socrates V, Redline S, Sahoo SS, ProvCaRe Semantic Provenance Knowledgebase: Evaluating Scientific Reproducibility of Research Studies, American Medical Informatics Association (AMIA) Annual Symposium, 2017, pp. 1688 – 1697 (Finalist for Distinguished Paper Award)

Sajatovic M, Tatsuoka C, Welter E, Friedman D, Spruill TM, Stoll S, Sahoo SS, Bukach A, Bamps YA, Valdez J, Jobst BC. Correlates of quality of life among individuals with epilepsy enrolled in self-management research: From the US Centers for Disease Control and Prevention Managing Epilepsy Well Network. Epilepsy Behavior. 2017 Jan 27. pii: S1525-5050(16)30742-9. PMID: 28139451

Gershon AL, Zonjy B, Tatsuoka C, Ghosh K, Lhatoo SD, Sahoo SS, A Flexible Computational Neuroinformatics Workflow for Computing Functional Networks in Epilepsy Neurological Disorder, American Medical Informatics Association (AMIA) Annual Symposium, Washington DC, 2017 (Abstract)

Gershon AL, Lhatoo SD, Tatsuoka C, Ghosh K, Loparo K, Sahoo SS, Scalable Signal Data Processing for Measuring Functional Connectivity in Epilepsy Neurological Disorder, Biomedical Signal Processing in Big Data, Ervin Sejdic, Tiago Falk (Eds), 2017 (in press)


Valdez J, Rueschman M, Kim M, Redline S, Sahoo SS. An Ontology-Enabled Natural Language Processing Pipeline for Provenance Metadata Extraction from Biomedical Text. 15th International Conference on Ontologies, DataBases, and Applications of Semantics (ODBASE) 2016: 699-708.

Sahoo SS, Ramesh P, Welter E, Bukach A, Valdez J, Tatsuoka C, Bamps Y, Stoll S, Jobst BC, Sajatovic M. Insight: An Ontology-based Integrated Database and Analysis Platform for Epilepsy Self-Management Research, International Journal of Medical Informatics, 2016. PMID: 27573308

Sahoo SS, Wei A, Valdez J, Wang L, Zonjy B, Tatsuoka C, Loparo KA, Lhatoo SD. NeuroPigPen: a Scalable Toolkit for Processing Electrophysiological Signal Data in Neuroscience Applications using Apache Pig, Frontiers in Neuroinformatics, 10:18. 2016. PMID: 27375472

Sahoo SS, Wei A, Tatsuoka C, Ghosh K, Lhatoo SD. Processing Neurology Clinical Data for Knowledge Discovery: Scalable Data Flows Using Distributed Computing, Book Chapter

Sahoo SS, Valdez J, Rueschman M. Scientific Reproducibility in Biomedical Research: Provenance Metadata Ontology for Semantic Annotation of Study Description, American Medical Informatics Association (AMIA) Annual Symposium, 2016:1070-1079 PMID: 28269904

Dean DA, Goldberger AL, Mueller R, Kim M, Rueschman M, Mobley D, Sahoo SS, Jayapandian C, Cui L, Morrical MG, Surovec S, Zhang GQ, Redline S. Scaling up Scientific Discovery in Sleep Medicine:The National Sleep Research Resource. 39(5): 1151-64. 2016. PMID: 27070134


Yang S, Tatsuoka C, Ghosh K, Lacuey-Lecumberri N, Lhatoo SD, Sahoo SS. Comparative Evaluation for Brain Structural Connectivity Approaches: Towards Integrative Neuroinformatics Tool for Epilepsy Clinical Research. AMIA 2016 Joint Summits on Translational Science. (Nominated for the Best Student Paper Award).446-54. PMID: 27570685

Ramesh P, Wei A, Sams J, Welter E, Lhatoo S, Sajatovic M, Sahoo SS. Insight: Semantic Provenance and Analysis Platform for Multi-center Neurology Healthcare Research. Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2015:731-736. PMID: 27069752

Sahoo SS, Rueschman M, Valdez J, Hsu W, Lhatoo SD, Redline S. Provenance Analysis over Biomedical Big Data Using PROV: Towards Effective Secondary Data Analysis Across Multiple Studies. NIH Big Data to Knowledge (BD2K) Meeting, Bethesda MD. Nov 12-13, 2015 (Poster).

Sahoo SS, Rao P. Provenance Analysis and RDF Query Processing: W3C PROV for Data Quality and Trust. In the 14th International Semantic Web Conference (ISWC 2015), Bethlehem, PA, 2015. (Tutorial, to appear)

Jayapandian C, Wei A, Ramesh P, Zonjy B, Lhatoo SD, Loparo K, Zhang GQ, Sahoo SS. A Scalable Neuroinformatics Data Flow for Electrophysiological Signals using MapReduce. Frontiers in Neuroinformatics. 2015 9:4. PMID: 25852536

Sahoo SS, Zhang GQ, Bamps Y, Fraser R, Stoll S, Lhatoo SD, Tatsuoka C, Welter E, Sajatovic M. Managing Information Well: Toward an Ontology-driven Informatics Platform for Data Sharing and Secondary Use in Epilepsy Self-Management Research Centers. Health Informatics Journal, 2015.22(3):548-61. PMID: 25769938 

LaFrance Jr. WC, Ranieri R, Bamps Y, Stoll S, Sahoo SS, Welter E, Sams J, Tatsuoka C, Sajatovic M. Comparison of common data elements from the Managing Epilepsy Well (MEW) Network integrated database and a well-characterized sample with nonepileptic seizures Epilepsy & Behavior. 2015.45:136. PMID: 25825372


Jayapandian CP, Chen CH, Dabir A, Zhang GQ, Lhatoo SD, Sahoo SS. Domain Ontology As Conceptual Model for Big Data Management: Application in Biomedical Informatics,Proceedings of the 33rd International Conference on Conceptual Modeling (ER 2014) 2014. pp. 144-157

Zhang GQ, Cui L, Lhatoo, SD, Schuele SU, Sahoo SS. MEDCIS: Multi-Modality Epilepsy Data Capture and Integration System. American Medical Informatics Association (AMIA) Annual Symposium, 2014:1248-57. PMID: 25954436 

Sahoo SS, Tao S, Parchman A, Luo Z, Cui L, Mergler P, Lanese R, Barnholtz-Sloan JS, Meropol NJ, Zhang GQ. Trial Prospector: Matching Patients with Cancer Research Studies using an Automated and Scalable Approach. Journal of Cancer Informatics 2014. Dec 4;13:157-66. PMID: 25506198

Cui L, Sahoo SS, Lhatoo SD, Garg G, Rai P, Bozorgi A, Zhang GQ. Complex Epilepsy Phenotype Extraction from Narrative Clinical Discharge Summaries. Journal of Biomedical Informatics 2014 Oct;51:272-9. PMID: 24973735

Sahoo SS, McIntyre C, Lhatoo SD. A Match Made in Cloud? Meeting the Requirements of the Next Generation Neuroscience Research Using Configurable Cloud Infrastructure. National Science Foundation (NSF) Cloud Workshop, Dec 11-12, 2014 


Sahoo SS, Jayapandian C, Garg G, Kaffashi F, Chung S, Bozorgi A, Chen CH, Loparo K, Lhatoo SD, Zhang GQ. Heartbeats in the Cloud: Distributed Analysis of Electrophysiological “Big Data” using Cloud Computing for Epilepsy Clinical Research. Journal of American Medical Informatics Association JAMIA (special issue on Big Data in Healthcare and Biomedical Research) 2013. 21(2):263-71 PMID: 24326538 (Editor’s Choice Article Special Issue)

Jayapandian CP, Chen CH, Bozorgi A, Lhatoo SD, Zhang GQ, Sahoo SS. Cloudwave: Distributed Processing of “Big Data” from Electrophysiological Recordings for Epilepsy Clinical Research Using Hadoop. American Medical Informatics Association (AMIA) Annual Symposium, 2013. pp. 691-700 PMID: 24551370

Sahoo SS, Lhatoo SD, Gupta DK, Cui L, Zhao M, Jayapadian C, Bozorgi A, Zhang GQ. Epilepsy and Seizure Ontology: Towards an Epilepsy Informatics Infrastructure for Clinical Research and Patient Care. Journal of American Medical Informatics Association (JAMIA), 2013. EPub doi:10.1136/amiajnl-2013-001696 PMID: 23686934

Bozorgi A, Chung S, Kaffashi F, Loparo KA, Sahoo SS, Zhang GQ, Kaiboriboon K, Lhatoo SD. Significant postictal hypotension: expanding the spectrum of seizure-induced autonomic dysregulation. Epilepsia. 2013 Sep;54(9):e127-30. doi: 10.1111/epi.12251. Epub 2013 Jun 12. PMID: 23758665

Cui L, Mueller R, Sahoo SS, Zhang GQ. Querying Complex Federated Clinical Data Using Ontological Mapping and Subsumption Reasoning.  IEEE International Conference on Healthcare Informatics 2013 (ICHI 2013) pp. 351-360.

Lebo T, Sahoo SS, McGuinness D. (eds.) PROV-O: The PROV Ontology. 30 April 2013, W3C Recommendation.

Sahoo SS, Zhang GQ, Lhatoo SD. Epilepsy Informatics and an Ontology-driven Infrastructure for Large Database Research and Patient Care in Epilepsy. Review Paper, Epilepsia, 2013. 54(8). pp. 1335-41. PMID: 23647220 (Editor’s Choice Article: September 2013)

Jayapandian CP, Chen CH, Bozorgi A, Lhatoo SD, Zhang GQ, Sahoo SS. Electrophysiological Signal Analysis and Visualization using Cloudwave for Epilepsy Clinical Research. The 14th World Congress on Medical and Health Informatics (MedInfo), Stud Health Technol Inform. 2013. Vol. 192. pp.817-21. PMID: 23920671

Asiaee AH, Doshi P, Minning T, Sahoo SS, Parikh P, Sheth A, Tarleton RL. From Questions to Effective Answers: On the Utility of Knowledge-Driven Querying Systems for Life Sciences Data. The 9th International Conference on Data Integration in the Life Sciences (DILS), 2013. pp. 38-45.

Parchman AJ, Zhang GQ, Mergler P, Barnholtz-Sloan J, Lanese R, Miller DW, Opper C,Sahoo SS, Tao S, Teagno J, Warfe J, Meropol NJ. Trial prospector: An automated clinical trials eligibility matching program. Proceedings of the American Society of Clinical Oncology (ASCO) Annual Meeting. 2013.


Jayapandian C, Zhao M, Ewing R, Zhang GQ, Sahoo SS. A Semantic Proteomics Dashboard (SemPoD) for Data Management in Translational Research. BMC Systems Biology, Vol. 6(Suppl 3):S20, 2012. PMID: 23282161

Parikh PP, Zheng J, Logan-Klumper F, Stoeckert Jr. CJ, Louis C, Topalis P, Protasio AV, Sheth AP, Carrington M, Berriman M, Sahoo SS. The Ontology for Parasite Lifecycle (OPL): Towards a Consistent Vocabulary of Lifecycle Stages in Parasitic Organisms. Journal Biomedical Semantics (JBMS), 2012. Vol. 23; 3(1): 5. PMID: 22621763

Zhang GQ, Sahoo SS, Lhatoo SD. From Classification to Epilepsy Ontology and Informatics. Epilepsia, 2012. Vol. 53(Suppl. 2). pp. 28-32. PMID: 22765502

Parikh PP, Minning TA, Nguyen V, Lalithsena S, Asiaee AH, Sahoo SS, Doshi P, Tarleton R, Sheth AP. A Semantic Problem Solving Environment for Integrative Parasite Research: Identification of Intervention Targets for Trypanosoma cruzi. PLoS Neglected Tropical Diseases, 2012. Vol. 6(1): e1458. PMID: 22272365

S.S. Sahoo, M. Zhao, L. Luo, A. Bozorgi, D. Gupta, S.D Lhatoo, GQ Zhang, “OPIC: Ontology-driven Patient Information Capturing System for Epilepsy.” Proceedings of the American Medical Informatics Association (AMIA) Annual Symposium, Chicago, IL, pp. 799-808, Nov 2012. PMID: 23304354

Cui L, Bozorgi A, Lhatoo SD, Zhang GQ, Sahoo SS.  EpiDEA: Extracting Structured Epilepsy and Seizure Information from Patient Discharge Summaries for Cohort Identification. American Medical Informatics Association (AMIA) Annual Symposium, 2012. pp. 1191-1200. PMID: 23304396

Zhang GQ, Luo L, Ogbuji C, Joslyn C, Mejino J, Sahoo SS.  An Analysis of Multi-type Relational Interactions in FMA Using Graph Motifs. American Medical Informatics Association (AMIA) Annual Symposium, 2012. pp. 1060-1069. PMID: 23304382

Teagno J, Kiefer RC, Pathak J, Zhang GQ, Sahoo SS.  A Distributed Semantic Web Approach for Cohort Identification. Proceedings of the American Medical Informatics Association (AMIA) Annual Symposium, 2012; pp. 1969

Jayapandian C, Ewing R, Zhang GQ, Sahoo SS.  A Semantic Proteomics Dashboard (SemPoD) for Proteomics Data Management in Translational Research. AMIA Clinical Research Informatics Summit (CRI), 2012. PMID: 23282161


Sahoo SS, Nguyen V, Bodenreider O, Parikh PP, Minning T, Sheth AP. A unified framework for managing provenance information in translational research. BMC Bioinformatics, 2011. Vol. 12:461. PMID: 22126369

Zhao J, Sahoo SS, Missier P, Sheth AP, Goble C. Extending Semantic Provenance into the Web of Data. IEEE Internet Computing, 2011. Vol. 15(1). pp. 40-48.

Sahoo SS, Ogbuji C, Luo L, Dong X, Cui L, Redline SS, Zhang GQ. MiDas: Automatic Extraction of a Common Domain of Discourse in Sleep Medicine for Multi-center Data Integration. American Medical Informatics Association (AMIA) Annual Symposium, 2011. pp. 1196-1205. PMID: 22195180

Sahoo SSTowards Desiderata for Provenance Ontologies in Biomedicine, International Conference on Biomedical Ontologies (ICBO), 2011. pp. 269-272.

Mueller R, Sahoo SS, Dong X, Redline S, Arabandi S, Luo L, Zhang GQ. Mapping multi-institution data sources to domain ontology for data federation: the PhysioMIMI approach. AMIA Clinical Research Informatics Summit (CRI), 2011.

Zhang GQ, Mueller R, Jonhson N, Arabandi S, Sahoo SS, Redline S. Online Exploration of Case-control Study Designs in VISAGE. AMIA Clinical Research Informatics Summit (CRI), 2011.


Barga R, Simmhan Y, Chinthaka-Withana E, Sahoo SS, Jackson J, Araujo N. Provenance for Scientific Workflows Towards Reproducible Research. IEEE Data Engineering Bulletin, 2010. Vol. 33(3). pp. 50-58.

Sahoo SS, Bodenreider O, Hitzler P, Sheth AP, Thirunarayan K. Provenance Context Entity (PaCE): Scalable provenance tracking for scientific RDF data. The 22nd International Conference on Scientific and Statistical Database Management (SSDBM), 2010. pp. 461-470. PMID: 25621321

Missier P, Sahoo SS, Zhao J, Goble C, Sheth A. Janus: from workflows to semantic provenance and linked open data. The 3rd International Provenance and Annotation Workshop (IPAW), Lecture Notes in Computer Science, Vol. 6378/2010, 2010. pp. 129-141.

Deus H, Zhao J, Sahoo SS, Samwald M, Prud’hommeaux E, Miller M, Marshall MS, Cheung K. Provenance of Microarray Experiments for a Better Understanding of Experiment Results. The 2nd International Workshop on Role of Semantic Web in Provenance Management (SWPM 2010), co-located with ISWC, 2010.

Patni H, Sahoo SS, Henson C, Sheth A. Provenance Aware Linked Sensor Data, The 2nd International Workshop on Trust and Privacy on the Social and Semantic Web, co-located with ESWC, 2010.

Sahoo SS, Groth P, Hartig O, Miles S, Coppens S, Myers J, Gil Y, Moreau L, Zhao J, Panzer M, Garijo D. Provenance Vocabulary Mappings. W3C Provenance Incubator Group Report, 2010.


Sahoo SS, Weatherly DB, Mutharaju R, Anantharam P, Sheth AP, Tarleton RL. Ontology-driven Provenance Management in eScience: an Application in Parasite Research. The 8th International Conference on Ontologies, DataBases, and Applications of Semantics, (ODBASE), 2009. pp. 992-1009.

Sahoo SS, Sheth A. Provenir ontology: Towards a Framework for eScience Provenance Management. Microsoft eScience Workshop, 2009.

Sahoo SS, Halb W, Hellmann S, Idehen K, Thibodeau Jr. T, Auer S, Sequeda J, Ezzat A.  A Survey of Current Approaches for Mapping of Relational Databases to RDF. W3C RDB2RDF Incubator Group Report, 2009.


Sahoo SS, Sheth AP, Henson C.  Semantic Provenance for eScience: ‘Meaningful’ Metadata to Manage the Deluge of Scientific Data. IEEE Internet Computing, Web-Scale Workflow Track, M.B. Blake and M. Huhns (Eds.), 2008. Vol. 12(4). pp.46-54. (Featured in Association of Computing Machinery (ACM) TechNews 2008)

Sahoo SS, Bodenreider O, Rutter JL, Skinner KJ, Sheth AP. An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence. Journal of Biomedical Informatics (Special Issue: Semantic Mashup of Biomedical Data), 2008. Vol. 41(5). pp. 752-65. PMID: 18395495

Sheth A, Henson C, Sahoo SS. Semantic Sensor Web. IEEE Internet Computing, 2008. Vol. 12(4). pp. 78-83.

Valerio MD, Sahoo SS, Barga RS, Jackson JJ.  Capturing Workflow Event Data for Monitoring, Performance Analysis, and Management of Scientific Workflows. SWBES08, co-located with the 4th IEEE International Conference on eScience, 2008. pp. 626-33.


Sahoo SS, Zeng K, Bodenreider O, Sheth AP.  From ‘glycosyltransferase’ to ‘congenital muscular dystrophy’: Integrating knowledge from NCBI Entrez Gene and the Gene Ontology. The 12th World Congress on Health (Medical) Informatics (Medinfo), 2007. pp. 1260–64. PMID: 17911917.

Sahoo SS, Bodenreider O, Zeng K, Sheth AP.  An experiment in integrating large biomedical knowledge resources with RDF: Application to associating genotype and phenotype information. International Workshop on Health Care and Life Sciences Data Integration for the Semantic Web, co-located with WWW2007, 2007.

Sahoo SS, Sheth A, Hunter B, York WS. SemBOWSER–Adding Semantics to biological Web services registry. Semantic Web: Revolutionizing Knowledge Discovery in the Life Sciences. Baker CJO, Cheung KO (Eds.), Springer, 2007. pp. 317–40.


Sahoo SS, Thomas C, Sheth AP, York WS, Tartir S.  Knowledge Modeling and Its Application in Life Sciences: A Tale of Two Ontologies. The 15th International World Wide Web (WWW) Conference, 2006. pp. 317-26

Sahoo SS, Sheth A. Bioinformatics applications of Web Services, Web Processes and role of Semantics. Semantic Web Processes and Their Applications. Cardoso J, Sheth A (Eds.), Springer, 2006. pp. 305–22.


Sahoo SS, Thomas C, Sheth AP, Henson C, York WS. GLYDE-An expressive XML standard for the representation of glycan structure. Carbohydrate Research, 2005. Vol. 340(18). pp.2802-7. PMID: 16242678

Atwood III J, Sahoo SS, Alvarez-Manilla G, Weatherly DB, Kolli K, Orlando R, York WS. Simple modification of a protein database for mass spectral identification of N-linked glycopeptides. Rapid Communications Mass Spectrometry, 2005. Vol. 19(21). pp.3002-6. PMID: 16196021

Alvarez-Manilla G, Atwood. III J, Sahoo SS, Guo Y, Warren NL, York WS, Orlando R, Pierce M.  Tools for glycoproteomic analysis: size-exclusion chromatography facilitates identification of tryptic glycopeptides with N-linked glycosylation site. Glycobiology 15(1208), 2005. PMID: 16512686

Aleman-Meza A, Halaschek-Wiener C, Sahoo SS, Sheth A, Arpinar B. Template Based Semantic Similarity for Security Applications. The IEEE Intl. Conference on Intelligence and Security Informatics (ISI-2005), 2005. pp: 621-622.

Sahoo SS, Sheth AP, York WS, Miller JA.  Semantic Web Services for N-glycosylation Process. International Symposium on Web Services for Computational Biology and Bioinformatics, 2005.


Sheth A, York WS, Thomas C, Nagarajan M, Miller JA, Kochut K, Sahoo SS, Yi X. Semantic Web technology in support of Bioinformatics for Glycan Expression. W3C Workshop on Semantic Web for Life Sciences, 2004.

Our Team

Satya Sahoo, PhD

Associate Professor of Medical Informatics

Nasim Shafiabadi, MD

Research Fellow

Arthur Gershon, PhD

Post-Doctoral Scholar

Chang Liu, MS Student

Research Assistant

Jianzhe Zhang, MS Student

Research Assistant

Our Alumni

contact info

phone 216-368-3286


street 2103 cornell road, room 6119
city, state cleveland, oh
building iris S. & bert l. wolstein research building
zipcode 44106-7291