| Newsletter Index | Quick-Tip Index | Search Newsletters |
As we discuss different configurations for our expected T3E, we are gathering user input. Our goal is to learn more about your MPP needs, both current and projected, to help guide our decisions.
Consider this: We face an inevitable tradeoff between number of PEs and amount of memory per PE. How do you come down on this?
Below (if you want to really get into it), are some "survey" questions. We would appreciate answers to these, along with your opinions on the "perfect" T3E configuration.
Contact Tom Baring at:
or 907-474-1899
denali$ mppview -L
_________________________________________________________________________
\ . . . . . . . . \
\ . . . . . . . . \
\________________________________________________________________________\
_________________________________________________________________________
\ . . . . . . . . \
\ . . . . . . . . \
\________________________________________________________________________\
_________________________________________________________________________
\ AAA AAA BBB BBB CCC CCC CCC CCC \
\ AAA AAA BBB BBB CCC CCC CCC CCC \
\________________________________________________________________________\
_________________________________________________________________________
\ AAA AAA BBB BBB CCC CCC CCC CCC \
\ AAA AAA BBB BBB CCC CCC CCC CCC \
\________________________________________________________________________\
Part User PID Program State Flags Shape- YZX (base) Elapsed
---- -------- ------ -------- ------ ------ ----------------------- ---------
26 AAA 61728 hib4A Active B 16= 2x 2x 4 (0x200) 1:05:25
37 CCC 63803 hib6B Active B 32= 2x 2x 8 (0x208) 0:27:39
48 BBB 55270 hib1C2 Active B 16= 2x 2x 4 (0x204) 3:12:26
denali$ qstat -a | grep pe_
-----------------------------
NQS 1.1 BATCH REQUEST SUMMARY
-----------------------------
IDENTIFIER NAME USER QUEUE JID PRTY REQMEM REQTIM ST
------------- ------- ----- --------------------- ---- ---- ------ ------ ---
99044.denali R011t AAA m_16pe_24h@denali 94875 20 3315 86361 R10
99033.denali R060t BBB m_16pe_24h@denali 94515 20 7405 86371 R10
98989.denali R128t CCC m_32pe_24h@denali 93635 20 9495 85565 R14
99048.denali runcp EEE m_32pe_24h@denali 499 16384 86400 Qqr
99003.denali runcp EEE m_64pe_24h@denali 286 16384 86400 Qce
98995.denali R013t AAA m_64pe_24h@denali 377 12288 86400 Qce
99037.denali run12 DDD m_128pe_8h@denali --- 2048 500 Qqs
There are a few reasons why this situation occurred and then perpetuated
itself:
As an aside, the "ST" column, above, is the job's status code. Here is an explanation of the codes in that column (see "man qstat" for other codes).
If the first status character is "R" then the job is running. If the "R" is followed by a number, then that is the number of currently active processes started by that request.
If the first character is "Q" then: "the request is in a queue and is eligible for routing or running." The two characters which follow indicate:
ce (Cray MPP systems only.) The complex Cray MPP
processing element (PE) limit was reached.
qr The queue run limit was reached.
qs The queue in which the request resides was stopped.
(The mpp_128pe_8h queue is "stopped" except from Fridays at 6:00 PM to
Sundays at 4:00 AM). See Newsletter #74 or execute the command "qstat
-m" for more information on the current T3D queue structure.
These two loader interfaces differ in command line inputs and default settings. One difference is in their treatment of unsatisfied external references. If a source program references externals which can not be found, mppldr will create an executable while mppld will not. This is the explanation for the "glitch" I encountered while linking MPICH v1.0.13 (which has two unsatisfied externals) and mentioned last week.
The "Cray MPP Loader User's Guide" (SG-2514 1.1) states:
The mppldr command provides a simple invocation method in which the loader handles many of the requirements of loading your program. The mppld command provides a traditional UNIX interface in which you must provide more information to the loader to load your program correctly.And:
Differences between In addition to differences in command-line
mppldr and mppld invocation formats, mppldr and mppld vary in other
2.6 ways. Table 3 summarizes these differences.
Table 3. mppldr and mppld differences
Feature mppldr mppld
Default directives file /lib/segdirs/mpp_def_seg /lib/segdirs/mpp_def_ld
Environment variable MPP_SEGDIR MPP_LDDIR
processing
Object file processing All object file All .o files
names are included (sequential object
as bin files. files) are included
as bin files. All
.a files (library
object files) are
included as lib
files.
Default setting of DUPENTRY=CAUTION DUPENTRY=CAUTION:
DUPENTRY directive :CAUTION:NOTE NOTE:NOTE.
Because of the
different dupentry
setting, and the
practice of
including library
object files as lib
files, mppld issues
fewer diagnostic
messages about
duplicated entry
point names than
mppldr.
Default setting of DUPORDER=OFF The DUPORDER=ON. An
DUPORDER directive first definition of ordered search
an entry point is algorithm is used.
chosen, regardless The entry point that
of the definition's mppld chooses depends
location. on the order of
definitions and
references. See
"DUPORDER
directive," for more
information.
Default system libraries A list of default No default libraries
libraries is are included. You
included. Most must specify all
common system libraries required
routines are by your program.
included in these
libraries.
Default setting for USX=CAUTION. A USX=WARNING. A
USX directive program that program that
contains unsatisfied contains unsatisfied
external references external references
is still executable is not executable
and mppldr exits and mppld exits
normally. Calls to with a nonzero error
unsatisfied status.
references are
intercepted when the
program is run.
Default setting for FORCE=OFF. Modules FORCE=ON. All
FORCE directive in bin files are modules encountered
included in the in bin files are
executable program included in the
only if they are executable program,
referenced, contain whether or not the
a main program, or modules are
initialize global referenced.
data.
It's probably not too smart to ignore compiler/linker warnings. You know
EXACTLY when your program will take a different execution path and hit
that unsatisfied external. (During a demo, of course.)
But anyway, here's a little example which uses unsatisfiable externals. Note that a similar FORTRAN program, compiled with cf77 (thus loaded by mppldr), is immediately executable.
/* goodbye.c */
#include <stdio.h>
main (int argc, char **argv) {
extern int Unsatisfied_External();
int i;
if (argc > 1)
i=Unsatisfied_External();
printf ("goodbye \n");
}
denali$ TARGET=cray-t3d; MPP_NPES=2
denali$ cc goodbye.c
[ Messages about unsatisfied externals and the statement:
mppldr-112 cc: WARNING
Because of previous errors, file 'a.out' is not executable. ]
denali$ a.out
ksh: a.out: cannot execute
denali$ chmod 700 a.out ; a.out
[ Messages about unsatisfied externals ... and then a correct run]
denali$ cc -c goodbye.c
denali$ mppldr goodbye.o
[ Messages about unsatisfied externals ... ]
denali$ a.out
[ Messages about unsatisfied externals ... and then a correct run]
denali$ a.out XXX
[ Messages about unsatisfied externals ... then a crash, because
it tries to execute "i=Unsatisfied_External()". ]
http://cuiwww.unige.ch/~ipps97/ | | 11th International Parallel Processing Symposium | 1-5 April 1997 | University of Geneva, Switzerland | | Important Dates: | | 20 September 1996 ..... Manuscripts Due | 13 December 1996 ..... Review Decisions Mailed | 20 January 1997 ..... Print Ready Paper Due | 30 August 1996 ..... Workshop Proposals Due | 31 October 1996 ..... Tutorial Proposals Due | 31 October 1996 ..... Commercial Exhibit Registration | | | Call for Participation: | | Sponsored by the Technical Committee on Parallel Processing, | the symposium is the committee's primary forum for engineers | and scientists from around the world to present their latest | research findings in the field. In addition to technical | sessions of submitted paper presentations, IPPS '97 will offer | workshops, tutorials, an industrial track, and commercial | exhibits. | | Our University of Geneva hosts are making arrangements for | on-campus housing as well as specially priced nearby hotel | accommodations. Also through the University, IPPS will be able | to provide daily luncheon in addition to the usual breaks and | refreshments. Full details will be available in the Advance | Program. | | Also in 1997, PARCON, the one-day Symposium on New Directions | in Parallel and Concurrent Computing, will co-locate with | IPPS. To accommodate their inclusion, workshops & tutorials | will be held the first and last days, papers will be presented | in technical sessions on the second and third days, and PARCON | will be presented on the fourth day.
My table headers were incorrect in two ways. The values are times (not "rates"), in microseconds, to transfer one entire buffer of the given size (not "value" of the given type). A sample correct header would look like this (changes are single-quoted):
Thanks go to the two readers who reported this.
http://www.cray.com/PUBLIC/product-info/T3E/CRAY_T3E.html
Contact:
Donald Bahls ARSC User Consultant ph: 907-450-8674 Ed Kornkven ARSC HPC Specialist ph: 907-450-8669 Arctic Region Supercomputing Center University of Alaska Fairbanks PO Box 756020 Fairbanks AK 99775-6020
Send comments and questions to the current editors using this Contact Form.E-mail Subscriptions:
| Newsletter Index | Quick-Tip Index | Search Newsletters |
Arctic Region Supercomputing Center
PO Box 756020, Fairbanks, AK 99775 | voice: 907-450-8600 | email:
home | search | about | support | news | science | resources