In unstructured finite volume method,loop on different mesh components such as cells,faces,nodes,etc is used widely for the traversal of *** loop results in direct or indirect data access that affects data locality **...
详细信息
In unstructured finite volume method,loop on different mesh components such as cells,faces,nodes,etc is used widely for the traversal of *** loop results in direct or indirect data access that affects data locality *** loop on mesh,many threads accessing the same data lead to data *** data locality and data dependence play an important part in the performance of GPU *** optimizing a GPU-accelerated unstructured finite volume Computational Fluid Dynamics(CFD)program,the performance of hot spots under different loops on cells,faces,and nodes is evaluated on Nvidia Tesla V100 and *** tests under different mesh scales show that the effects of mesh loop modes are different on data locality and data ***,face loop makes the best data locality,so long as access to face data exists in *** loop brings the smallest overheads due to non-coalescing data access,when both cell and node data are used in computing without face *** loop owns the best performance in the condition that only indirect access of cell data exists in *** operations reduced the performance of kernels largely in K80,which is not obvious on *** the suitable mesh loop mode in all kernels,the overall performance of GPU simulations can be increased by 15%-20%.Finally,the program on a single GPU V100 can achieve maximum 21.7 and average 14.1 speed up compared with 28 MPI tasks on two Intel CPUs Xeon Gold 6132.
Several recent business reports have described the global growth in demand for optical and photonic components,paralleled by technical reports on the growing shortage of skilled manufacturing staff to meet this *** is...
详细信息
Several recent business reports have described the global growth in demand for optical and photonic components,paralleled by technical reports on the growing shortage of skilled manufacturing staff to meet this *** is remarkable that producing ultraprecision surfaces remains so dependent on people,in contrast to other sectors of the economy,e.g.,car ***,training can play some role,but ultimately,only process automation can provide the *** paper explores why automation is a challenge and summarizes multidisciplinary work aiming to assemble the building blocks required to realize automation.
The unique and unanticipated properties of multiple principal component alloys have reinvigorated the field of alloy design and drawn strong interest across scientific *** vast compositional parameter space makes thes...
详细信息
The unique and unanticipated properties of multiple principal component alloys have reinvigorated the field of alloy design and drawn strong interest across scientific *** vast compositional parameter space makes these alloys a unique area of exploration by means of computational ***,as of now a method to compute efficiently,yet with high accuracy the thermodynamic properties of such alloys has been *** of the underlying reasons is the lack of accurate and efficient approaches to compute vibrational free energies—including anharmonicity—for these chemically complex multicomponent *** this work,a density-functional-theory based approach to overcome this issue is developed based on a combination of thermodynamic integration and a machine-learning *** demonstrate the performance of the approach by computing the anharmonic free energy of the prototypical five-component VNbMoTaW refractory high entropy alloy.
暂无评论