MCSR_logo.jpg (56K)
Home My MCSR Supercomputers Software Research Education
Login
Issues/Circulation
Current Issue: June 2006

Past Issues

Education
Parallel Prgrmng. Class

MCSR in the Classroom

Research
MCSR at MAS 2006

SC05 Education Pgm.

Verify Your Grants

MyMCSR on the Web Introducing MyMCSR

Supercomputers
HW Upgrades for June 06

Redwood SUSE Upgrade

Software
Matlab Memory Issue

CPMD on Redwood

X-Win32 Upgrade

Abaqus 6.5 Installed

Amber 8 on Redwood



MCSRLogo2 (17K)

Volume 5, Number 1
May 2006

Amber Version 8 on Redwood

by Taner Pirim

The optimized version of Amber ver.8 is now installed on redwood at /usr/local/appl/Amber8 by using Intel Math Kernel Library, MKL, version 7.2.1 as well as Amber ver. 7. Also, Amber 8 is installed on mimosa cluster. The performances of Amber 8 and Amber 7 on Redwood and Mimosa have been analyzed for various numbers of cpus together with Dr. Randy M. Wadkins. The results obtained from the benchmark sets were compared. Amber 8/7 benchmarks were ran for 2, 4, 8, 16, and 32 cpus and the wallclock times obtained from the output files.

As shown in the figure above, Amber 7 and 8 benchmarks on Redwood outperformed the benchmark results of Amber 8 on mimosa for all the cpu parallelization. Redwood finished the runs at least 2.5 times faster than mimosa. On less than 8 cpu jobs, redwood completed the runs an average of approximately five times faster than mimosa. Also, Amber 8 performed slightly better than Amber 7 on redwood for all the runs.

 
Wallclock Time (secs)
Speed up1
# of CPU Mimosa Amber8 Redwood Amber7 Redwood Amber8 Redwood Amber7 Redwood Amber8
2 663.20 262.26 193.57 2.53 3.43
4 460.90 140.82 106.85 4.71 6.21
8 288.82 97.40 80.94 6.81 8.19
16 247.76 52.17 42.70 12.71 15.53
32 220.96 44.19 39.32 15.01 16.87

1Speed up is calculated by Mimosa Time / Redwood Time

When the speed-up for each run calculated by dividing the runtimes of Redwood to Mimosa runtimes, the speed-ups for each run were better when the numbers of cpus were increased as shown in above Table. Thus, when the speed-ups of 8 and 16 cpu runs were compared, the 16 cpu jobs completed the runs in half the time than 8 cpu jobs. However, 32 cpu speed-ups showed an average of 11% improvement than 16 cpu speed-ups even though the number of cpus used were doubled. Thus, for the benchmarks the best efficiency were obtained by running the benchmarks on 16 processors. If you would like more information on the versions of Amber installed on our systems, please visit our Amber website. For more questions on Amber, please contact us via assist@mcsr.olemiss.edu.


Last Modified:June 08, 2007 10:31:45.   Copyright © 1997-2012 The Mississippi Center for Supercomputing Research. All Rights Reserved.   The University of Mississippi
Valid RSS