FINAL Technical Report DOE Award DE-SC0015527 [electronic resource]

Saved in:
Bibliographic Details
Online Access: Full Text (via OSTI)
Format: Government Document Electronic eBook
Language:English
Published: Oak Ridge, Tenn. : Distributed by the Office of Scientific and Technical Information, U.S. Department of Energy, 2019.
Subjects:

MARC

LEADER 00000nam a22000003u 4500
001 b11011301
003 CoU
005 20200310000000.0
006 m o d f
007 cr |||||||||||
008 200416e20190214||| o| f0|||||eng|d
035 |a (TOE)ost1593835 
035 |a (TOE)1593835 
040 |a TOE  |c TOE 
049 |a GDWR 
086 0 |a E 1.99:1593835 
086 0 |a E 1.99:1593835 
245 0 0 |a FINAL Technical Report DOE Award DE-SC0015527  |h [electronic resource] 
260 |a Oak Ridge, Tenn. :  |b Distributed by the Office of Scientific and Technical Information, U.S. Department of Energy,  |c 2019. 
300 |a Medium: ED :  |b digital, PDF file. 
336 |a text  |b txt  |2 rdacontent. 
337 |a computer  |b c  |2 rdamedia. 
338 |a online resource  |b cr  |2 rdacarrier. 
500 |a Published through Scitech Connect. 
500 |a 02/14/2019. 
500 |a Newman, Harvey B.; Vlimant, Jean-Roch; Balcas, Justas; Puttaswamy, Shashwitha;  
500 |a California Institute of Technology (CalTech), Pasadena, CA (United States) 
500 |a USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21) 
520 3 |a This project aims to enable the LHC and other leading programs in high energy physics and other global science domains funded by the DOE to operate with a new level of efficiency and control, through the development of a Next Generation Integrated Architecture (NGenIA) based on intelligent software defined network (SDN)-driven network systems coupled to high throughput applications. While the initial focus is on the challenging LHC use case, the systems and products being developed are general, and apply to many fields of data intensive science ranging from astrophysical sky surveys to bioinformatics and earth observation, as well as other organizations facing the challenges of extracting knowledge from distributed multi-petabyte data stores. A central concept in this development program is a new paradigm ?consistent network operations? among widely distributed computing and storage facilities, where stable high throughput flows, at set rates, cross load-balanced network paths, up to flexible high water marks that are adjusted in real time to accommodate other network traffic. The large smooth flows are launched and managed by SDN services that act in concert with the experiments? site-resident data distribution and management systems, to meet the expanding needs of the science programs. The technical goals include the construction of autonomous, intelligent site-resident services that interact dynamically with network-resident services, and with the science programs? principal data distribution and management tools, to request or command network resources in support of high throughput petascale to exascale workflow Specific work items include: 1. Developing compact Data Transfer Nodes (DTNs) with auto-tuning functions that support data transfer rates in the 100 Gbps to the 1 Tbps range when used with high throughput data transfer applications 2. Deep site orchestration among virtualized clusters, storage subsystems and subnets to successfully coschedule CPU, storage and network resources 3. Science-program designed site architectures, operational modes, and policy and resource usage priorities, adjudicated across multiple network domains and virtual organizations, using the orchestration functions and methods 4. Seamlessly extending end-to-end operation across both extra-site and intra-site boundaries through the use of Open vSwitch (OVS), FireQoS + next generation Science DMZs 5. Novel methods of system integration that enable granular control of extreme scale long distance transfers through flow matching of scattered source-destination address pairs to multi-domain dynamic circuits 6. Funneling massive sets of streams to DTNs at the site edge hosting petascale buffer pools configured for flows of 100 Gbps and up, exploiting state of the art data transfers 7. Adaptive scheduling based on pervasive end-to-end monitoring, including DTN or compute-node resident agents providing comprehensive end-system profiling 8. Developing unsupervised and supervised machine learning and modeling methods to optimize the workflow involving terabyte to multi-petabyte datasets. 
536 |b SC0015527. 
650 7 |a 97 mathematics and computing  |2 local. 
650 7 |a 72 physics of elementary particles and fields  |2 local. 
650 7 |a 47 other instrumentation  |2 local. 
650 7 |a Software defined networking global distributed systems lhc and lsst computing and networking  |4 spn  |2 local. 
710 1 |a United States.  |b Department of Energy.  |b Office of Scientific and Technical Information  |4 dst. 
856 4 0 |u http://www.osti.gov/servlets/purl/1593835  |z Full Text (via OSTI) 
907 |a .b110113019  |b 11-30-21  |c 05-11-20 
998 |a web  |b 05-11-20  |c f  |d m   |e p  |f eng  |g    |h 0  |i 1 
956 |a Information bridge 
999 f f |i e1ba14e6-9946-59c9-b59e-b57a5e3e8501  |s f24073d7-be12-59ec-bda8-2a38c7611fe7 
952 f f |p Can circulate  |a University of Colorado Boulder  |b Online  |c Online  |d Online  |e E 1.99:1593835  |h Superintendent of Documents classification  |i web  |n 1