{"id":102344,"date":"2022-03-08T13:32:42","date_gmt":"2022-03-08T18:32:42","guid":{"rendered":"http:\/\/www.bumc.bu.edu\/busm\/?p=102344"},"modified":"2022-03-08T13:37:13","modified_gmt":"2022-03-08T18:37:13","slug":"data-science-core-opens-for-business","status":"publish","type":"post","link":"https:\/\/www.bumc.bu.edu\/camed\/2022\/03\/08\/data-science-core-opens-for-business\/","title":{"rendered":"Data Science Core Opens for Business"},"content":{"rendered":"<p><img loading=\"lazy\" src=\"\/camed\/files\/2022\/03\/Data-Science-Core-e1646688051991.jpg\" alt=\"Data Science Visual graphic\" class=\"size-full wp-image-102347 alignleft\" width=\"300\" height=\"192\" \/>Two years ago faculty, and particularly the genomics research community, at BUSM asked for help with data analysis. Dean Karen Antman, MD,\u00a0 held a data science workshop in February 2020 and the result was a decision to create a service core staffed by faculty scientists to provide the level of analysis and bioinformatics (big data analysis of large data sets using computer and statistical analyses on biological data) that scientists needed to further their research.<\/p>\n<p>The COVID-19 pandemic put that effort on hold until this past September when the search for data science faculty was reopened.<\/p>\n<figure id=\"attachment_102369\" aria-describedby=\"caption-attachment-102369\" style=\"width: 310px\" class=\"wp-caption alignright\"><img loading=\"lazy\" src=\"\/camed\/files\/2022\/03\/Zhang_Chao-scaled-e1646763077619.jpg\" alt=\"Chao Zhang standing with arms folded\" class=\"size-full wp-image-102369\" width=\"300\" height=\"409\" \/><figcaption id=\"caption-attachment-102369\" class=\"wp-caption-text\">Chao Zhang<\/figcaption><\/figure>\n<p>The result: the Applied Data Science Core (ADSC) opened for business with the launch of its <a href=\"https:\/\/www.bumc.bu.edu\/camed\/research\/cores\/data-science-core-dsc\/\">website<\/a> and its first hire, Assistant Professor of Computational Biomedicine Chao Zhang, PhD. Associate Professor of Computational Biomedicine Ignaty Leshchiner, PhD, will join Zhang this May.<\/p>\n<p>\u201cThe goal is to have on campus the analytical capabilities needed to support and advance the biomedical research of the BUSM faculty,\u201d said Andrew W. Taylor, PhD, BUSM associate dean for research. \u201cBioinformatics and biostatistical programmers know how to handle large data sets.\u201d<\/p>\n<p>Taylor said that a secondary goal was for the ADSC faculty to conduct their own research into their specialties.<\/p>\n<p>\u201cApplying data sciences is foundational for advancing our field, and so, we can\u2019t wait to collaborate with the new core,\u201d said Darrell Kotton, MD, the David C. Seldin Professor of Medicine and director of the BU\/BMC Center for Regenerative Medicine (CReM).<\/p>\n<p>The new service complements BU President Robert Brown\u2019s vision of the University becoming a leader in the data science field, realized in the ongoing construction of a new Center for Computing &amp; Data Sciences on Commonwealth Avenue.<\/p>\n<p>\u201cThere\u2019s a lot of excitement at BU for data science,\u201d said Nelson Lau, PhD, associate professor of biochemistry and director of BU\u2019s Genome Sciences Institute (GSI).<\/p>\n<p>\u201cIn the biomedicine field we are awash in genomic and sequencing data. Even labs that never managed data before find they need to in order to stay current and competitive,\u201d said Lau.<\/p>\n<p>\u201cAnyone doing animal or disease research requires genetic sequencing that can involve gigabytes, even terabytes, of data to map diseases back to the genomes and compare healthy to diseased to get to even a fundamental understanding of what you are working with,\u201d Lau said.<\/p>\n<p>He is already working with Zhang on a project in which they analyze RNA data sequenced from mosquitos collected at field stations in Connecticut to see what other viruses, besides the commonly known ones like West Nile and Eastern Equine Encephalitis, might be present.<\/p>\n<p>While Lau is trained in bioinformatics and his lab already has two bioinformatics specialists, he said he can\u2019t help everyone who needs this specialized data analysis.<\/p>\n<p>\u201cMany PI\u2019s (principal investigators) like myself have to take care of our own research first,\u201d said Lau, who worked with Taylor and Antman with a goal of democratizing access.<\/p>\n<p>\u201cEven with centers as successful as the CReM and GSI where we have our own bioinformatics groups, it would be great to see more labs at BUSM have access to this type of analysis too,\u201d said Lau.<\/p>\n<p>Before this new service, BUSM researchers without direct access to data analysis specialists either found another researcher to help them or contracted with an outside company.<\/p>\n<p>\u201cInstead of sending (their data) to an outside company, we have the opportunity to interact with them to get a better result,\u201d said Zhang, who has a doctorate in computer science and a masters\u2019 degree in statistics.<\/p>\n<p>\u201cTypically, for-profit companies just do basic analysis for everyone. We are researchers and we will try to help them interpret and understand the data, the deeper background of the whole story,\u201d said Zhang. \u201cThe ultimate goal is to use technology on large scale data to solve those complicated questions.\u201d<\/p>\n<p>His specialty is computational biomedicine, the application of computer methodology to help in the diagnosis and treatment of disease. Zhang said one of the applications could be genomics analysis of a tumor sample to find a better treatment specific to that patient.<\/p>\n<p>\u201cA lot of people do data science projects that are data-heavy,\u201d said Vasan Ramachandran, MD, professor of medicine and epidemiology at BUSM and SPH, and principal investigator and director of the Framingham Heart Study (FHS). He compared the new data center to a community pool where a lot of people could swim without the expense of building individual pools.<\/p>\n<p>\u201cHaving a centralized data core would cater to a larger group of researchers where they could find one-stop expertise. The ADSC would be able to work together with researchers to really meet what has heretofore been an unmet need,\u201d said Ramachandran, who believed there may be as many unfunded as there are funded research projects underway at BUSM.<\/p>\n<p>Being able to do data analysis in-house may also help in getting grants, he said. \u201cThey assume you have the infrastructure.\u201d<\/p>\n<p>\u201cI think it will be helpful in applying for grants,\u201d said Lau. \u201cThere\u2019s just a higher chance of (the project) succeeding. Instead of begging for help from another institution, you can just go down the hall and talk to someone on the same team. It\u2019s a more personalized experience.\u201d<\/p>\n<p>While it is not yet in the ADSC toolkit, Ramachandran looks forward to computerized data analysis that can combine data from heterogenous sources. For example, Ramachandran\u2019s RURAL Cohort Study is evaluating the underlying risks in rural areas.<\/p>\n<p>\u201cWhy are people in rural areas challenged in terms of their health?\u201d he asked. The answer may lie in a combination of reasons that involve varied data sources like genetics, diet, medical and health records, that can be stored in entirely different formats.<\/p>\n<p>\u201cThe underlying architecture of the data is different befitting the multi-dimensional nature of the disease,\u201d Ramachandran said. The information is siloed but understanding risk and building health takes a multi-faceted approach and there isn\u2019t one expert who can coordinate the data, he said.<\/p>\n<p>\u201cThe ASDC Core hopes to provide a resource with complementary expertise in different domains,\u201d said Ramachandran.<\/p>\n<p>Taylor, who oversees the running of BUSM cores, said the ASDC will be initially funded by Dean\u2019s Office.<\/p>\n<p>\u201cAs time goes by it should be supported by subcontracts and multiple principal investigator grants,\u201d said Taylor, with the possibility of additional funding coming through data science instruction and student training.<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Its goal is to have analytical capabilities on campus that are needed to support and advance the biomedical research of  BUSM faculty.<\/p>\n","protected":false},"author":903,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[146,91,90],"tags":[],"_links":{"self":[{"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/posts\/102344"}],"collection":[{"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/users\/903"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/comments?post=102344"}],"version-history":[{"count":10,"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/posts\/102344\/revisions"}],"predecessor-version":[{"id":102375,"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/posts\/102344\/revisions\/102375"}],"wp:attachment":[{"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/media?parent=102344"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/categories?post=102344"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.bumc.bu.edu\/camed\/wp-json\/wp\/v2\/tags?post=102344"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}