{"id":485,"date":"2016-08-22T11:14:48","date_gmt":"2016-08-22T15:14:48","guid":{"rendered":"https:\/\/www.bumc.bu.edu\/genetics\/?page_id=485"},"modified":"2020-01-08T17:40:30","modified_gmt":"2020-01-08T22:40:30","slug":"computational-resources","status":"publish","type":"page","link":"https:\/\/www.bumc.bu.edu\/genetics\/about\/computational-resources\/","title":{"rendered":"Computational Resources"},"content":{"rendered":"<p>The Biomedical Genetics Researchers have available for their compute and data intensive research the\u00a0<strong>Boston University Shared Computing Cluster<\/strong>\u00a0(SCC). The SCC is located in Holyoke, MA, site of the LEED Platinum certified\u00a0<a href=\"http:\/\/www.bu.edu\/tech\/support\/research\/rcs\/mghpcc\/\">Massachusetts Green High Performance Computing Center (MGHPCC)<\/a>\u00a0where energy is plentiful, clean, and inexpensive. Two pairs of 10Gigabit Ethernet network connections between the MGHPCC and the BU campus provide extremely fast data transfer between the two locations. The SCC is a heterogeneous Linux cluster composed of both\u00a0<em>shared<\/em>\u00a0and\u00a0<em>buy-in<\/em>components. The system currently includes over 2600 shared processors, over 5100\u00a0<a href=\"http:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/service-models\/buy-in\/\">buy-in<\/a>\u00a0processors, a combined 244 GPUs, and over two petabytes of storage (approximately 75% of this is Buy-in storage) for research data. The SCC is suitable for high-performance computing for both compute and storage intensive analyses required for bioinformatics and genomics research. An detail summary of the SCC resource can be found <a href=\"https:\/\/www.bu.edu\/tech\/support\/research\/computing-resources\/tech-summary\/\">here<\/a>. Research Computing installs and maintains an extensive set of bioinformatics, genomic and statistical modules that have been installed for supporting an extensive range of processing and analyses.<\/p>\n<p style=\"text-align: center;\"><a href=\"\/genetics\/files\/2016\/08\/Computing-Center-John-F.-pic-8.22.16.jpg\"><img loading=\"lazy\" width=\"200\" height=\"270\" class=\" size-full wp-image-1851 aligncenter\" alt=\"Computing Center John F. pic 8.22.16\" src=\"\/genetics\/files\/2016\/08\/Computing-Center-John-F.-pic-8.22.16.jpg\" \/><\/a><\/p>\n<p style=\"text-align: left;\">In addition for scalable data intensive research for the processing of large sequencing and variant files, a Shared Hadoop Cluster is also available that runs Apache Spark. \u00a0\u00a0<a href=\"http:\/\/spark.apache.org\/docs\/latest\/index.html\">Apache Spark<\/a> is a fast and general-purpose cluster computing system that has high-level APIs in Java, Scala, Python and R. Many higher level tools are available to scale analyses: \u00a0<a href=\"http:\/\/spark.apache.org\/docs\/latest\/sql-programming-guide.html\">Spark SQL<\/a>\u00a0for SQL and structured data processing,\u00a0<a href=\"http:\/\/spark.apache.org\/docs\/latest\/ml-guide.html\">MLlib<\/a>\u00a0for machine learning,\u00a0<a href=\"http:\/\/spark.apache.org\/docs\/latest\/graphx-programming-guide.html\">GraphX<\/a>\u00a0for graph processing, and\u00a0<a href=\"https:\/\/spark.apache.org\/docs\/latest\/sparkr.html\">SparkR<\/a> for statistical analyses.\u00a0 The\u00a0redesigned-for-Spark\u00a0GATK 4 software will soon be available for\u00a0performing variant discovery analysis in high-throughput sequencing (HTS) data\u00a0to\u00a0take advantage of this\u00a0Spark distributed computing framework to speed up robust pipelines for genomic research.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Biomedical Genetics Researchers have available for their compute and data intensive research the\u00a0Boston University Shared Computing Cluster\u00a0(SCC). The SCC is located in Holyoke, MA, site of the LEED Platinum certified\u00a0Massachusetts Green High Performance Computing Center (MGHPCC)\u00a0where energy is plentiful, clean, and inexpensive. Two pairs of 10Gigabit Ethernet network connections between the MGHPCC and the [&hellip;]<\/p>\n","protected":false},"author":1162,"featured_media":0,"parent":481,"menu_order":1,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/www.bumc.bu.edu\/genetics\/wp-json\/wp\/v2\/pages\/485"}],"collection":[{"href":"https:\/\/www.bumc.bu.edu\/genetics\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.bumc.bu.edu\/genetics\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.bumc.bu.edu\/genetics\/wp-json\/wp\/v2\/users\/1162"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bumc.bu.edu\/genetics\/wp-json\/wp\/v2\/comments?post=485"}],"version-history":[{"count":15,"href":"https:\/\/www.bumc.bu.edu\/genetics\/wp-json\/wp\/v2\/pages\/485\/revisions"}],"predecessor-version":[{"id":1853,"href":"https:\/\/www.bumc.bu.edu\/genetics\/wp-json\/wp\/v2\/pages\/485\/revisions\/1853"}],"up":[{"embeddable":true,"href":"https:\/\/www.bumc.bu.edu\/genetics\/wp-json\/wp\/v2\/pages\/481"}],"wp:attachment":[{"href":"https:\/\/www.bumc.bu.edu\/genetics\/wp-json\/wp\/v2\/media?parent=485"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}