AUSTIN, Texas — The much-anticipated University of Texas Data Repository (UTDR) named "Corral" is available to researchers at all 15 University of Texas System institutions, the Texas Advanced Computing Center (TACC) at The University of Texas at Austin announced today.
- Medicine - 08:00 FDA Approves Roche’s HPV Test for First- Line Primary Screening for Cervical Cancer
- Medicine - 08:00 Novartis’ INSTEAD study for Onbrez Breezhaler in patients with moderate COPD meets primary objective
- Philosophy - 07:00 Investigating Hitler’s philosophers
- Architecture - Apr 24 Cloud artist leaves lasting impression of fleeting work
- Agronomy - Apr 24 Society’s fixation with children’s weight can be counterproductive
- Life Sciences - Apr 24 Penn Fruitfly Study Identifies Brain Circuit that Drives Daily Cycles of Rest, Activity
- Medicine - Apr 24 Michael Owen scores in London Marathon for Manchester eye research and treatment
- Life Sciences - Apr 24 A scourge of rural Africa, the tsetse fly is genetically deciphered
- Medicine - Apr 24 Roger Roffman chronicles society’s long struggle with pot in ‘Marijuana Nation’
- Physics - Apr 24 When it comes to security at nuclear facilities, danger likely lurks from within, Stanford scholar says
- Social Sciences - Apr 24 Obama’s Asian trip reflects new global dynamics, Stanford scholar says
- Administration - Apr 24 Stronger SUNet passwords to enhance account security at Stanford
- Philosophy - Apr 24 $4.5M project focuses on hope and optimism
- Administration - Apr 24 Already sharing services, NYS schools could do more
- Chemistry - Apr 24 Lighting up the lab: Team harnesses light for controlled chemical reaction
- Chemistry - Apr 24 Coated droplets hint at formation of early cells
TACC Provides Multi-Petabyte Data Repository to Enable Research at University of Texas System Institutions
The data repository is part of the overall University of Texas Research Cyberinfrastructure (UTRC) project, a $23 million initiative announced in December 2010 to enable world-class research and foster stronger collaborations among researchers in Texas and around the world. The UTRC project ensures that researchers across Texas can effectively use advanced computing capabilities, including high-performance computing for simulation and analysis, high-capacity storage for large digital data collections, and high-bandwidth networking connecting institutions and resources.
As one of the largest online storage systems available to academic researchers in the United States, Corral provides six petabytes of data, which is equal to 50 times the size of the entire collection of DVDs at Netflix. University of Texas System researchers whose data needs outstrip their local capacity are invited to apply for allocations on Corral using the Allocations Request System available through the TACC User Portal.
In recent years there has been an explosion of "big data" in science and engineering research. Big data is a term applied to data sets whose size is beyond the ability of commonly used software tools and commodity computers to manage. Such data sets range from a few dozen terabytes to many petabytes of data in a single data set. This data comes from many sources such as gene sequencers, imaging systems, real-time sensors that monitor the environment, and from computational simulations.
"Through the UTRC and UTDR effort, the UT System is bringing data infrastructure to researchers to match the massive computational infrastructure available through TACC," said Patricia Hurn, the system’s associate vice chancellor for health science research.
In January TACC announced that the data repository, which will be composed of two identical installations -- one in Austin and one in Arlington -- would enter an early-user phase to encourage feedback and explore additional capabilities that researchers might need such as data security, interfaces and capacity.
"Our work with early users from multiple academic and health campuses within the UT System has helped us to select tools for transferring and managing data in the system, and has shown the system to be stable and ready for wider use," said Chris Jordan, leader of the Data Management and Collections group at TACC and chair of the storage committee for the UTRC initiative. "As a result, we are now making the data repository open to all eligible UT System researchers."
The storage is directly accessible from TACC’s Lonestar 4 supercomputer, which was expanded last year as part of the UTRC initiative. In addition, all University of Texas institutions will eventually be connected at 10 gigabits per second to enable faster transfer of data to and from the data repository.
So far, the UTDR project has imported more than 200 terabytes of data to Corral, mostly next-generation sequencing and functional MRI data. The rate of data growth in UTDR is expected to grow as the network connectivity is ramped up across all 15 institutions.
"Corral has been essential for us as we scale-up output from our next-generation DNA sequencing center," said Scott Hunicke-Smith, director of diagnostics and genome sequencing at the Texas Institute for Drug and Diagnostic Development.
"The TACC installation has been so robust and well supported that we can use Corral to re-distribute data out to dozens of collaborator labs via many different interfaces. We also love Corral’s ability to assign arbitrary metadata to these large and valuable data files for searching, verification or provenance management," he said.
Researchers who need five terabytes of data storage or less will have free access during the first year, and researchers requiring more can purchase storage for $250 a terabyte per year. Some storage will also be set aside to support strategic, collaborative projects that enhance the leadership position of University of Texas System institutions.
Researchers may request allocations on Corral via an online web page through the TACC User Portal. Training sessions on data management topics including the use of Corral will be provided on an ongoing basis. Please refer to the " Events and Training " section of TACC’s website.
Last job offers
- Computer Science - 25.4
- Life Sciences - 24.4
Post-Doctoral Position in Protein Engineering
- Social Sciences - 23.4
Dozentin / Dozent mit Unterrichts- und Forschungstätigkeit
- Computer Science - 23.4
Wissensch. Mitarbeiter/in (70 %) Neue Bildungsinhalte im Bereich Informatik
- Media Sciences - 23.4
Wissensch. Mitarbeiter/in (60 %) Kommunikation und Organisation neuer Bildungsinhalte im Bereich Informatik...
- Literature - 22.4
Doktorand/in Französische Linguistik Schwerpunkt Angewandte Linguistik
- Physics - 24.4
Professor of Nuclear Reactor Physics
- Medicine - 24.4
Resmed Chair in Sleep-disordered Breathing & Chronic Disease Management
- Arts - 23.4
Universitätsprofessorin / Universitätsprofessor für das Fach Klavier / Klavierduo (BV gem. 99 UG 2002)...
- Business - 16.4
Universitätsassistent/in post doc Non Tenure Track (Assistant Professor, non-tenure-track)
- Law - 24.4
Professur für Strafrecht mit Kriminologie, Strafvollzugsrecht und Jugendstrafrecht
- Media Sciences - 24.4
Universitätsprofessur (Bes.-Gr. W2) für, Professionelle Kommunikation in elektronischen Medien / Social...
- Physics - 23.4
WRIPA Project Manager
- Administration - 23.4
Associate Professorship (or Professorship) in Public Policy (Public Administration and Management)
- Medicine - 24.4
Assistant/ Associate/ Full Professor
- Medicine - 24.4
Instructor / Asst Prof / Assoc Prof - OB/Gyn (Generalist)