octave-nkf: liboctave/UMFPACK/UMFPACK/Source/umf

author	jwe
date	Fri, 25 Feb 2005 19:55:28 +0000
parents
children

rev	line source
5164 57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	1 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	2 /* === umf_config.h ========================================================= */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	3 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	4
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	5 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	6 /* UMFPACK Version 4.4, Copyright (c) 2005 by Timothy A. Davis. CISE Dept, */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	7 /* Univ. of Florida. All Rights Reserved. See ../Doc/License for License. */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	8 /* web: http://www.cise.ufl.edu/research/sparse/umfpack */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	9 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	10
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	11 /*
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	12 This file controls the compile-time configuration of UMFPACK. Modify the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	13 Makefile, the architecture-dependent Make.* file, and this file if
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	14 necessary, to control these options. The following flags may be given
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	15 as options to your C compiler (as in "cc -DNBLAS", for example). These
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	16 flags are normally placed in your CONFIG string, defined in your Make.*.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	17
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	18 All of these options, except for the timer, are for accessing the BLAS.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	19
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	20 -DNBLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	21
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	22 BLAS mode. If -DNBLAS is set, then no BLAS will be used. Vanilla
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	23 C code will be used instead. This is portable, and easier to
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	24 install, but you won't get the best performance.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	25
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	26 If -DNBLAS is not set, then externally-available BLAS routines
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	27 (dgemm, dger, and dgemv or the equivalent C-BLAS routines) will be
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	28 used. This will give you the best performance, but perhaps at the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	29 expense of portability.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	30
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	31 The default is to use the BLAS, for both the C-callable libumfpack.a
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	32 library and the MATLAB mexFunction. If you have trouble installing
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	33 UMFPACK, set -DNBLAS (but then UMFPACK will be slow).
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	34
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	35 -DCBLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	36
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	37 If -DCBLAS is set, then the C-BLAS interface to the BLAS is
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	38 used. If your vendor-supplied BLAS library does not have a C-BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	39 interface, you can obtain the ATLAS BLAS, available at
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	40 http://www.netlib.org/atlas.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	41
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	42 This flag is ignored if -DNBLAS is set.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	43
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	44 -DLP64
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	45
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	46 This should be defined if you are compiling in the LP64 model
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	47 (32 bit int's, 64 bit long's, and 64 bit pointers). In Solaris,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	48 this is obtained with the flags -xtarget=ultra -xarch=v9 for
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	49 the cc compiler (for example).
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	50
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	51 -DLONGBLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	52
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	53 If not defined, then the BLAS are not called in the long integer
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	54 version of UMFPACK (the umfpack_l_ routines). The most common
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	55 definitions of the BLAS, unfortunately, use int arguments, and
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	56 are thus not suitable for use in the LP64 model. Only the Sun
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	57 Performance Library, as far as I can tell, has a version of the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	58 BLAS that allows long integer (64-bit) input arguments. This
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	59 flag is set automatically in Sun Solaris if you are using the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	60 Sun Performance BLAS. You can set it yourself, too, if your BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	61 routines can take long integer input arguments.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	62
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	63 -DNSUNPERF
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	64
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	65 Applies only to Sun Solaris. If -DNSUNPERF is set, then the Sun
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	66 Performance Library BLAS will not be used.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	67
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	68 The Sun Performance Library BLAS is used by default when compiling
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	69 the C-callable libumfpack.a library on Sun Solaris.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	70
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	71 This flag is ignored if -DNBLAS is set.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	72
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	73 -DNSCSL
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	74
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	75 Applies only to SGI IRIX. If -DSCSL is set, then the SGI SCSL
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	76 Scientific Library BLAS will not be used.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	77
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	78 The SGI SCSL Scientific Library BLAS is used by default when
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	79 compiling the C-callable libumfpack.a library on SGI IRIX.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	80
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	81 This flag is ignored if -DNBLAS is set.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	82
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	83 -DNPOSIX
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	84
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	85 If -DNPOSIX is set, then your Unix operating system is not POSIX-
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	86 compliant, and the POSIX routines sysconf ( ) and times ( )
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	87 routines are not used. These routines provide CPU time and
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	88 wallclock time information. If -DNPOSIX is set, then the ANSI
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	89 C clock ( ) routine is used. If -DNPOSIX is not set, then
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	90 sysconf ( ) and times ( ) are used in umfpack_tic and umfpack_toc.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	91 See umfpack_tictoc.c for more information.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	92 The default is to use the POSIX routines, except for Windows,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	93 which is not POSIX-compliant.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	94
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	95 -DGETRUSAGE
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	96
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	97 If -DGETRUSAGE is set, then your system's getrusage ( ) routine
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	98 will be used for getting the process CPU time. Otherwise the ANSI
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	99 C clock ( ) routine will be used. The default is to use getrusage
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	100 ( ) on Unix systems, and to use clock on all other architectures.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	101
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	102 -DNO_TIMER
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	103
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	104 If -DNO_TIMER is set, then no timing routines are used at all.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	105
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	106 -DNUTIL
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	107
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	108 If -DNUTIL is set, then the internal MATLAB utMalloc, utFree, and
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	109 utRealloc routines are not used in the UMFPACK mexFunction. The
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	110 regular mxMalloc, mxFree, and mxRealloc routines are used instead.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	111 These routines are not documented, but are available for use. For
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	112 Windows, -DNUTIL is defined below, because access to the ut*
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	113 routines is not available by default.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	114
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	115 -DNRECIPROCAL
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	116
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	117 This option controls a tradeoff between speed and accuracy. Using
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	118 -DNRECIPROCAL can lead to more accurate results, but with perhaps
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	119 some cost in performance, particularly if floating-point division
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	120 is much more costly than floating-point multiplication.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	121
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	122 This option determines the method used to scale the pivot column.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	123 If set, or if the absolute value of the pivot is < 1e-12 (or is a
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	124 NaN), then the pivot column is divided by the pivot value.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	125 Otherwise, the reciprocal of the pivot value is computed, and the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	126 pivot column is multiplied by (1/pivot). Multiplying by the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	127 reciprocal can be slightly less accurate than dividing by the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	128 pivot, but it is often faster. See umf_scale.c.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	129
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	130 This has a small effect on the performance of UMFPACK, at least on
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	131 a Pentium 4M. It may have a larger effect on other architectures
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	132 where floating-point division is much more costly than floating-
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	133 point multiplication. The RS 6000 is one such example.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	134
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	135 By default, the method chosen is to multiply by the reciprocal
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	136 (sacrificing accuracy for speed), except when compiling UMFPACK
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	137 as a built-in routine in MATLAB, or when gcc is being used.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	138
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	139 When MATHWORKS is defined, -DNRECIPROCAL is forced on, and the pivot
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	140 column is divided by the pivot value. The only way of using the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	141 other method in this case is to edit this file.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	142
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	143 If -DNRECIPROCAL is enabled, then the row scaling factors are always
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	144 applied by dividing each row by the scale factor, rather than
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	145 multiplying by the reciprocal. If -DNRECIPROCAL is not enabled
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	146 (the default case), then the scale factors are normally applied by
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	147 multiplying by the reciprocal. If, however, the smallest scale
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	148 factor is tiny, then the scale factors are applied via division.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	149
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	150 -DNO_DIVIDE_BY_ZERO
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	151
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	152 If the pivot is zero, and this flag is set, then no divide-by-zero
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	153 occurs.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	154
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	155 You should normally not set these flags yourself:
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	156
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	157 -DBLAS_BY_VALUE if scalars are passed by value, not reference
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	158 -DBLAS_NO_UNDERSCORE if no underscore should be appended
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	159 -DBLAS_CHAR_ARG if BLAS options are single char's, not strings
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	160
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	161 The BLAS options are normally set automatically. If your
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	162 architecture cannot be determined (see UMFPACK_ARCHITECTURE, below)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	163 then you may need to set these flags yourself.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	164
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	165 The following options are controlled by amd_internal.h:
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	166
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	167 -DMATLAB_MEX_FILE
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	168
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	169 This flag is turned on when compiling the umfpack mexFunction for
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	170 use in MATLAB. When compiling the MATLAB mexFunction, the MATLAB
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	171 BLAS are used (unless -DNBLAS is set). The -DCBLAS, -DNSCSL, and
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	172 -DNSUNPERF flags are all ignored. The -DNRECIPROCAL flag is
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	173 forced on. Otherwise, [L,U,P,Q,R] = umfpack (A) would return
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	174 either LU = P(R\A)Q or LU = PRA*Q. Rather than returning a
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	175 flag stating how the scale factors R are to be applied, the umfpack
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	176 mexFunction always takes the more accurate route and returns
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	177 LU = P(R\A)*Q.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	178
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	179 -DMATHWORKS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	180
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	181 This flag is turned on when compiling umfpack as a built-in routine
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	182 in MATLAB. The MATLAB BLAS are used for all architectures (-DNBLAS,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	183 -DCBLAS, -DNSCSL, and -DNSUNPERF flags are all ignored). Internal
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	184 routines utMalloc, utFree, utRealloc, utPrintf, utDivideComplex,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	185 and utFdlibm_hypot are used, and the "util.h" file is included.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	186 This avoids the problem discussed in the User Guide regarding memory
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	187 allocation in MATLAB. utMalloc returns NULL on failure, instead of
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	188 terminating the mexFunction (which is what mxMalloc does). However,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	189 the ut* routines are not documented by The MathWorks, Inc., so I
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	190 cannot guarantee that you will always be able to use them.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	191 The -DNRECIPROCAL flag is turned on.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	192
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	193 -DNDEBUG
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	194
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	195 Debugging mode (if NDEBUG is not defined). The default, of course,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	196 is no debugging. Turning on debugging takes some work (see below).
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	197 If you do not edit this file, then debugging is turned off anyway,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	198 regardless of whether or not -DNDEBUG is specified in your compiler
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	199 options.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	200 */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	201
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	202 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	203 /* === AMD configuration ==================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	204 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	205
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	206 /* NDEBUG, PRINTF defined in amd_internal.h */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	207
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	208 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	209 /* === reciprocal option ==================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	210 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	211
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	212 /* Force the definition NRECIPROCAL when MATHWORKS or MATLAB_MEX_FILE
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	213 * are defined. Do not multiply by the reciprocal in those cases. */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	214
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	215 #ifndef NRECIPROCAL
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	216 #if defined (MATHWORKS) \|\| defined (MATLAB_MEX_FILE)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	217 #define NRECIPROCAL
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	218 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	219 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	220
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	221 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	222 /* === Microsoft Windows configuration ====================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	223 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	224
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	225 #ifdef UMF_WINDOWS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	226 /* Windows can't access the ut* routines, and it isn't Unix. */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	227 #define NUTIL
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	228 #define NPOSIX
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	229 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	230
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	231 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	232 /* === 0-based or 1-based printing ========================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	233 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	234
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	235 #if defined (MATLAB_MEX_FILE) && defined (NDEBUG)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	236 /* In MATLAB, matrices are 1-based to the user, but 0-based internally. */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	237 /* One is added to all row and column indices when printing matrices */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	238 /* for the MATLAB user. The +1 shift is turned off when debugging. */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	239 #define INDEX(i) ((i)+1)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	240 #else
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	241 /* In ANSI C, matrices are 0-based and indices are reported as such. */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	242 /* This mode is also used for debug mode, and if MATHWORKS is defined rather */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	243 /* than MATLAB_MEX_FILE. */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	244 #define INDEX(i) (i)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	245 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	246
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	247 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	248 /* === Timer ================================================================ */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	249 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	250
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	251 /*
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	252 If you have the getrusage routine (all Unix systems I've test do), then use
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	253 that. Otherwise, use the ANSI C clock function. Note that on many
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	254 systems, the ANSI clock function wraps around after only 2147 seconds, or
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	255 about 36 minutes. BE CAREFUL: if you compare the run time of UMFPACK with
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	256 other sparse matrix packages, be sure to use the same timer. See
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	257 umfpack_tictoc.c for the timer used internally by UMFPACK. See also
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	258 umfpack_timer.c for the timer used in an earlier version of UMFPACK.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	259 That timer is still available as a user-callable routine, but it is no
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	260 longer used internally by UMFPACK.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	261 */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	262
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	263 /* Sun Solaris, SGI Irix, Linux, Compaq Alpha, and IBM RS 6000 all have */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	264 /* getrusage. It's in BSD unix, so perhaps all unix systems have it. */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	265 #if defined (UMF_SOL2) \|\| defined (UMF_SGI) \|\| defined (UMF_LINUX) \
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	266 \|\| defined (UMF_ALPHA) \|\| defined (UMF_AIX)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	267 #define GETRUSAGE
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	268 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	269
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	270
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	271 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	272 /* === BLAS ================================================================= */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	273 /* ========================================================================== */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	274
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	275 /*
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	276 The adventure begins. Figure out how to call the BLAS ...
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	277
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	278 This works, but it is incredibly ugly. The C-BLAS was supposed to solve
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	279 this problem, and make it easier to interface a C program to the BLAS.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	280 Unfortunately, the C-BLAS does not have a "long" integer (64 bit) version.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	281 Various vendors have done their own 64-bit BLAS. Sun has dgemm_64 routines
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	282 with "long" integers, SGI has a 64-bit dgemm in their scsl_blas_i8 library
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	283 with "long long" integers, and so on.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	284
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	285 Different vendors also have different ways of defining a complex number,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	286 some using struct's. That's a bad idea. See umf_version.h for the better
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	287 way to do it (the method that was also chosen for the complex C-BLAS,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	288 which is compatible and guaranteed to be portable with ANSI C).
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	289
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	290 To make matters worse, SGI's SCSL BLAS has a C-BLAS interface which
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	291 differs from the ATLAS C-BLAS interface (see immediately below);
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	292 although a more recent version of SGI's C-BLAS interface is correct
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	293 if SCSL_VOID_ARGS is defined.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	294 */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	295
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	296
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	297 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	298 /* Determine which BLAS to use. */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	299 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	300
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	301 #if defined (MATHWORKS)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	302 #define USE_MATLAB_BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	303
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	304 #elif defined (NBLAS)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	305 #define USE_NO_BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	306
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	307 #elif defined (MATLAB_MEX_FILE)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	308 #define USE_MATLAB_BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	309
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	310 #elif defined (CBLAS)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	311 #define USE_C_BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	312
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	313 #elif defined (UMF_SOL2) && !defined (NSUNPERF)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	314 #define USE_SUNPERF_BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	315
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	316 #elif defined (UMF_SGI) && !defined (NSCSL)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	317 #define USE_SCSL_BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	318
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	319 #else
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	320 #define USE_FORTRAN_BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	321 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	322
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	323 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	324 /* int vs. long integer arguments */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	325 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	326
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	327 /*
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	328 Determine if the BLAS exists for the long integer version. It exists if
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	329 LONGBLAS is defined in the Makefile, or if using the BLAS from the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	330 Sun Performance Library, or SGI's SCSL Scientific Library.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	331 */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	332
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	333 #if defined (USE_SUNPERF_BLAS) \|\| defined (USE_SCSL_BLAS)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	334 #ifndef LONGBLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	335 #define LONGBLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	336 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	337 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	338
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	339 /* do not use the BLAS if Int's are long and LONGBLAS is not defined */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	340 #if defined (LONG_INTEGER) && !defined (LONGBLAS) && !defined (USE_NO_BLAS)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	341 #define USE_NO_BLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	342 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	343
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	344
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	345 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	346 /* Use (void ) arguments for the SGI /
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	347 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	348
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	349 #if defined (UMF_SGI)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	350 /*
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	351 Use (void *) pointers for complex types in SCSL.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	352 The ATLAS C-BLAS, and the SGI C-BLAS differ. The former uses (void *)
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	353 arguments, the latter uses SCSL_ZOMPLEX_T, which are either scsl_zomplex
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	354 or (void ). Using (void ) is simpler, and is selected by defining
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	355 SCSL_VOID_ARGS, below. The cc compiler doesn't complain, but gcc is
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	356 more picky, and generates a warning without this next statement.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	357 With gcc and the 07/09/98 version of SGI's cblas.h, spurious warnings
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	358 about complex BLAS arguments will be reported anyway. This is because this
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	359 older version of SGI's cblas.h does not make use of the SCSL_VOID_ARGS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	360 parameter, which is present in the 12/6/01 version of SGI's cblas.h. You
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	361 can safely ignore these warnings.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	362 */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	363 #define SCSL_VOID_ARGS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	364 #endif
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	365
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	366
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	367 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	368 /* The BLAS exists, construct appropriate macros */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	369 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	370
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	371 #if !defined (USE_NO_BLAS) /* { */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	372
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	373 /*
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	374 If the compile-time flag -DNBLAS is defined, then the BLAS are not used,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	375 portable vanilla C code is used instead, and the remainder of this file
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	376 is ignored.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	377
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	378 Using the BLAS is much faster, but how C calls the Fortran BLAS is
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	379 machine-dependent and thus can cause portability problems. Thus, use
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	380 -DNBLAS to ensure portability (at the expense of speed).
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	381
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	382 Preferences:
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	383
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	384 *** The best interface to use, regardless of the option you select
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	385 below, is the standard C-BLAS interface. Not all BLAS libraries
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	386 use this interface. The only problem with this interface is that
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	387 it does not extend to the LP64 model. The C-BLAS does not provide
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	388 for a 64-bit integer. In addition, SGI's older cblas.h can cause
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	389 spurious warnings when using the C-BLAS interface.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	390
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	391 1) often the most preferred (but see option (3)): use the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	392 optimized vendor-supplied library (such as the Sun Performance
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	393 Library, or IBM's ESSL). This is often the fastest, but might not
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	394 be portable and might not always be available. When compiling a
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	395 MATLAB mexFunction it might be difficult get the mex compiler
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	396 script to recognize the vendor- supplied BLAS. Note that the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	397 freely-available BLAS (option 3) can be faster than the vendor-
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	398 specific BLAS. You are encourage to try both option (1) and (3).
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	399
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	400 2) When compiling the UMFPACK mexFunction to use UMFPACK in MATLAB, use
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	401 the BLAS provided by The Mathworks, Inc. This assumes you are using
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	402 MATLAB V6 or higher, since the BLAS are not incorporated in V5 or
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	403 earlier versions. On my Sun workstation, the MATLAB BLAS gave
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	404 slightly worse performance than the Sun Perf. BLAS. The advantage
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	405 of using the MATLAB BLAS is that it's available on any computer that
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	406 has MATLAB V6 or higher. I have not tried using MATLAB BLAS outside
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	407 of a mexFunction in a stand-alone C code, but MATLAB (V6) allows for
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	408 this. This is well worth trying if you have MATLAB and don't want
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	409 to bother installing the ATLAS BLAS (option 3a, below). The only
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	410 glitch to this is that MATLAB does not provide a portable interface
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	411 to the BLAS (an underscore is required for some but not all
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	412 architectures). For Windows and MATLAB 6.0 or 6.1, you also need
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	413 to copy the libmwlapack.dll file into your MATLAB installation
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	414 directory; see the User Guide for details.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	415
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	416 In the current distribution, the only BLAS that the UMFPACK
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	417 mexFunction will use is the internal MATLAB BLAS. It's possible to
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	418 use other BLAS, but handling the porting of using the mex compiler
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	419 with different BLAS libraries is not trivial.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	420
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	421 As of MATLAB 6.5, the BLAS used internally in MATLAB is the ATLAS
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	422 BLAS.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	423
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	424 3) Use a freely-available high-performance BLAS library:
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	425
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	426 (a) The BLAS by Kazashige Goto and Robert van de Geijn, at
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	427 http://www.cs.utexas.edu/users/flame/goto. This BLAS increased
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	428 the performance of UMFPACK by almost 50% as compared to the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	429 ATLAS BLAS (v3.2).
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	430
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	431 (b) The ATLAS BLAS, available at http://www.netlib.org/atlas,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	432 by R. Clint Whaley, Antoine Petitet, and Jack Dongarra.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	433 This has a standard C interface, and thus the interface to it is
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	434 fully portable. Its performance rivals, and sometimes exceeds,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	435 the vendor-supplied BLAS on many computers.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	436
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	437 (b) The Fortran RISC BLAS by Michel Dayde', Iain Duff, Antoine
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	438 Petitet, and Abderrahim Qrichi Aniba, available via anonymous
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	439 ftp to ftp.enseeiht.fr in the pub/numerique/BLAS/RISC directory,
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	440 See M. J. Dayde' and I. S. Duff, "The RISC BLAS: A blocked
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	441 implementation of level 3 BLAS for RISC processors, ACM Trans.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	442 Math. Software, vol. 25, no. 3., Sept. 1999. This will give
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	443 you good performance, but with the same C-to-Fortran portability
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	444 problems as option (1).
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	445
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	446 4) Use UMFPACK's built-in vanilla C code by setting -DNBLAS at compile
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	447 time. The key advantage is portability, which is guaranteed if you
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	448 have an ANSI C compliant compiler. You also don't need to download
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	449 any other package - UMFPACK is stand-alone. No Fortran is used
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	450 anywhere in UMFPACK. UMFPACK will be much slower than when using
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	451 options (1) through (3), however.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	452
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	453 5) least preferred: use the standard Fortran implementation of the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	454 BLAS, also available at Netlib (http://www.netlib.org/blas). This
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	455 will be no faster than option (4), and not portable because of
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	456 C-to-Fortran calling conventions. Don't bother trying option (5).
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	457
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	458 The mechanics of how C calls the BLAS on various computers are as follows:
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	459
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	460 * C-BLAS (from the ATLAS library, for example):
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	461 The same interface is used on all computers.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	462
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	463 * Defaults for calling the Fortran BLAS:
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	464 add underscore, pass scalars by reference, use string arguments.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	465
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	466 * The Fortran BLAS on Sun Solaris (when compiling the MATLAB mexFunction
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	467 or when using the Fortran RISC BLAS), SGI IRIX, Linux, and Compaq
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	468 Alpha: use defaults.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	469
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	470 * Sun Solaris (when using the C-callable Sun Performance library):
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	471 no underscore, pass scalars by value, use character arguments.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	472
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	473 * The Fortran BLAS (ESSL Library) on the IBM RS 6000, and HP Unix:
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	474 no underscore, pass scalars by reference, use string arguments.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	475
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	476 * The Fortran BLAS on Windows:
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	477 no underscore, pass scalars by reference, use string arguments.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	478 If you compile the umfpack mexFunction using umfpack_make, and are
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	479 using the lcc compiler bundled with MATLAB, then you must first
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	480 copy the umfpack\lcc_lib\libmwlapack.lib file into the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	481 <matlab>\extern\lib\win32\lcc\ directory, where <matlab> is the
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	482 directory in which MATLAB is installed. Next, type mex -setup
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	483 at the MATLAB prompt, and ask MATLAB to select the lcc compiler.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	484 MATLAB has built-in BLAS, but it cannot be accessed by a program
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	485 compiled by lcc without first copying this file.
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	486 */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	487
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	488
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	489
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	490 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	491 #ifdef USE_C_BLAS /* { */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	492 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	493
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	494
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	495 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	496 /* use the C-BLAS (any computer) */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	497 /* -------------------------------------------------------------------------- */
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	498
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	499 /*
57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe] jwe parents: diff changeset	500 C-BLAS is the default interface, with the following exceptions. Solaris

5164

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

1 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

2 /* === umf_config.h ========================================================= */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

3 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

4

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

5 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

8 /* web: http://www.cise.ufl.edu/research/sparse/umfpack */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

9 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

10

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

11 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

12 This file controls the compile-time configuration of UMFPACK. Modify the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

13 Makefile, the architecture-dependent Make.* file, and this file if

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

14 necessary, to control these options. The following flags may be given

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

15 as options to your C compiler (as in "cc -DNBLAS", for example). These

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

16 flags are normally placed in your CONFIG string, defined in your Make.*.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

17

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

18 All of these options, except for the timer, are for accessing the BLAS.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

19

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

20 -DNBLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

21

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

22 BLAS mode. If -DNBLAS is set, then no BLAS will be used. Vanilla

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

23 C code will be used instead. This is portable, and easier to

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

24 install, but you won't get the best performance.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

25

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

26 If -DNBLAS is not set, then externally-available BLAS routines

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

27 (dgemm, dger, and dgemv or the equivalent C-BLAS routines) will be

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

28 used. This will give you the best performance, but perhaps at the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

29 expense of portability.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

30

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

31 The default is to use the BLAS, for both the C-callable libumfpack.a

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

32 library and the MATLAB mexFunction. If you have trouble installing

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

33 UMFPACK, set -DNBLAS (but then UMFPACK will be slow).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

34

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

35 -DCBLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

36

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

37 If -DCBLAS is set, then the C-BLAS interface to the BLAS is

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

38 used. If your vendor-supplied BLAS library does not have a C-BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

39 interface, you can obtain the ATLAS BLAS, available at

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

40 http://www.netlib.org/atlas.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

41

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

42 This flag is ignored if -DNBLAS is set.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

43

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

44 -DLP64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

45

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

46 This should be defined if you are compiling in the LP64 model

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

47 (32 bit int's, 64 bit long's, and 64 bit pointers). In Solaris,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

48 this is obtained with the flags -xtarget=ultra -xarch=v9 for

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

49 the cc compiler (for example).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

50

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

51 -DLONGBLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

52

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

53 If not defined, then the BLAS are not called in the long integer

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

54 version of UMFPACK (the umfpack_*l_* routines). The most common

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

55 definitions of the BLAS, unfortunately, use int arguments, and

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

56 are thus not suitable for use in the LP64 model. Only the Sun

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

57 Performance Library, as far as I can tell, has a version of the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

58 BLAS that allows long integer (64-bit) input arguments. This

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

59 flag is set automatically in Sun Solaris if you are using the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

60 Sun Performance BLAS. You can set it yourself, too, if your BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

61 routines can take long integer input arguments.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

62

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

63 -DNSUNPERF

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

65 Applies only to Sun Solaris. If -DNSUNPERF is set, then the Sun

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

66 Performance Library BLAS will not be used.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

67

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

68 The Sun Performance Library BLAS is used by default when compiling

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

69 the C-callable libumfpack.a library on Sun Solaris.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

70

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

71 This flag is ignored if -DNBLAS is set.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

72

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

73 -DNSCSL

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

74

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

75 Applies only to SGI IRIX. If -DSCSL is set, then the SGI SCSL

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

76 Scientific Library BLAS will not be used.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

77

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

78 The SGI SCSL Scientific Library BLAS is used by default when

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

79 compiling the C-callable libumfpack.a library on SGI IRIX.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

80

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

81 This flag is ignored if -DNBLAS is set.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

82

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

83 -DNPOSIX

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

84

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

85 If -DNPOSIX is set, then your Unix operating system is not POSIX-

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

86 compliant, and the POSIX routines sysconf ( ) and times ( )

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

87 routines are not used. These routines provide CPU time and

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

88 wallclock time information. If -DNPOSIX is set, then the ANSI

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

89 C clock ( ) routine is used. If -DNPOSIX is not set, then

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

90 sysconf ( ) and times ( ) are used in umfpack_tic and umfpack_toc.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

91 See umfpack_tictoc.c for more information.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

92 The default is to use the POSIX routines, except for Windows,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

93 which is not POSIX-compliant.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

94

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

95 -DGETRUSAGE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

96

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

97 If -DGETRUSAGE is set, then your system's getrusage ( ) routine

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

98 will be used for getting the process CPU time. Otherwise the ANSI

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

99 C clock ( ) routine will be used. The default is to use getrusage

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

100 ( ) on Unix systems, and to use clock on all other architectures.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

101

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

102 -DNO_TIMER

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

103

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

104 If -DNO_TIMER is set, then no timing routines are used at all.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

105

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

106 -DNUTIL

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

107

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

108 If -DNUTIL is set, then the internal MATLAB utMalloc, utFree, and

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

109 utRealloc routines are not used in the UMFPACK mexFunction. The

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

110 regular mxMalloc, mxFree, and mxRealloc routines are used instead.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

111 These routines are not documented, but are available for use. For

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

112 Windows, -DNUTIL is defined below, because access to the ut*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

113 routines is not available by default.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

114

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

115 -DNRECIPROCAL

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

116

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

117 This option controls a tradeoff between speed and accuracy. Using

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

118 -DNRECIPROCAL can lead to more accurate results, but with perhaps

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

119 some cost in performance, particularly if floating-point division

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

120 is much more costly than floating-point multiplication.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

121

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

122 This option determines the method used to scale the pivot column.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

123 If set, or if the absolute value of the pivot is < 1e-12 (or is a

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

124 NaN), then the pivot column is divided by the pivot value.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

125 Otherwise, the reciprocal of the pivot value is computed, and the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

126 pivot column is multiplied by (1/pivot). Multiplying by the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

127 reciprocal can be slightly less accurate than dividing by the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

128 pivot, but it is often faster. See umf_scale.c.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

129

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

130 This has a small effect on the performance of UMFPACK, at least on

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

131 a Pentium 4M. It may have a larger effect on other architectures

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

132 where floating-point division is much more costly than floating-

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

133 point multiplication. The RS 6000 is one such example.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

134

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

135 By default, the method chosen is to multiply by the reciprocal

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

136 (sacrificing accuracy for speed), except when compiling UMFPACK

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

137 as a built-in routine in MATLAB, or when gcc is being used.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

138

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

139 When MATHWORKS is defined, -DNRECIPROCAL is forced on, and the pivot

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

140 column is divided by the pivot value. The only way of using the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

141 other method in this case is to edit this file.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

142

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

143 If -DNRECIPROCAL is enabled, then the row scaling factors are always

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

144 applied by dividing each row by the scale factor, rather than

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

145 multiplying by the reciprocal. If -DNRECIPROCAL is not enabled

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

146 (the default case), then the scale factors are normally applied by

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

147 multiplying by the reciprocal. If, however, the smallest scale

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

148 factor is tiny, then the scale factors are applied via division.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

149

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

150 -DNO_DIVIDE_BY_ZERO

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

151

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

152 If the pivot is zero, and this flag is set, then no divide-by-zero

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

153 occurs.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

154

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

155 You should normally not set these flags yourself:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

156

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

157 -DBLAS_BY_VALUE if scalars are passed by value, not reference

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

158 -DBLAS_NO_UNDERSCORE if no underscore should be appended

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

159 -DBLAS_CHAR_ARG if BLAS options are single char's, not strings

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

160

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

161 The BLAS options are normally set automatically. If your

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

162 architecture cannot be determined (see UMFPACK_ARCHITECTURE, below)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

163 then you may need to set these flags yourself.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

164

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

165 The following options are controlled by amd_internal.h:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

166

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

167 -DMATLAB_MEX_FILE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

168

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

169 This flag is turned on when compiling the umfpack mexFunction for

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

170 use in MATLAB. When compiling the MATLAB mexFunction, the MATLAB

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

171 BLAS are used (unless -DNBLAS is set). The -DCBLAS, -DNSCSL, and

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

172 -DNSUNPERF flags are all ignored. The -DNRECIPROCAL flag is

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

173 forced on. Otherwise, [L,U,P,Q,R] = umfpack (A) would return

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

174 either L*U = P*(R\A)*Q or L*U = P*R*A*Q. Rather than returning a

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

175 flag stating how the scale factors R are to be applied, the umfpack

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

176 mexFunction always takes the more accurate route and returns

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

177 L*U = P*(R\A)*Q.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

178

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

179 -DMATHWORKS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

180

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

181 This flag is turned on when compiling umfpack as a built-in routine

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

182 in MATLAB. The MATLAB BLAS are used for all architectures (-DNBLAS,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

183 -DCBLAS, -DNSCSL, and -DNSUNPERF flags are all ignored). Internal

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

184 routines utMalloc, utFree, utRealloc, utPrintf, utDivideComplex,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

185 and utFdlibm_hypot are used, and the "util.h" file is included.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

186 This avoids the problem discussed in the User Guide regarding memory

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

187 allocation in MATLAB. utMalloc returns NULL on failure, instead of

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

188 terminating the mexFunction (which is what mxMalloc does). However,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

189 the ut* routines are not documented by The MathWorks, Inc., so I

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

190 cannot guarantee that you will always be able to use them.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

191 The -DNRECIPROCAL flag is turned on.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

192

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

193 -DNDEBUG

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

194

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

195 Debugging mode (if NDEBUG is not defined). The default, of course,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

196 is no debugging. Turning on debugging takes some work (see below).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

197 If you do not edit this file, then debugging is turned off anyway,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

198 regardless of whether or not -DNDEBUG is specified in your compiler

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

199 options.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

200 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

201

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

202 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

203 /* === AMD configuration ==================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

204 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

205

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

206 /* NDEBUG, PRINTF defined in amd_internal.h */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

207

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

208 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

209 /* === reciprocal option ==================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

210 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

211

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

212 /* Force the definition NRECIPROCAL when MATHWORKS or MATLAB_MEX_FILE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

213 * are defined. Do not multiply by the reciprocal in those cases. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

214

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

215 #ifndef NRECIPROCAL

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

216 #if defined (MATHWORKS) || defined (MATLAB_MEX_FILE)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

217 #define NRECIPROCAL

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

218 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

219 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

220

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

221 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

222 /* === Microsoft Windows configuration ====================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

223 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

224

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

225 #ifdef UMF_WINDOWS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

226 /* Windows can't access the ut* routines, and it isn't Unix. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

227 #define NUTIL

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

228 #define NPOSIX

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

229 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

230

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

231 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

232 /* === 0-based or 1-based printing ========================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

233 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

234

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

235 #if defined (MATLAB_MEX_FILE) && defined (NDEBUG)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

236 /* In MATLAB, matrices are 1-based to the user, but 0-based internally. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

237 /* One is added to all row and column indices when printing matrices */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

238 /* for the MATLAB user. The +1 shift is turned off when debugging. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

239 #define INDEX(i) ((i)+1)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

240 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

241 /* In ANSI C, matrices are 0-based and indices are reported as such. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

242 /* This mode is also used for debug mode, and if MATHWORKS is defined rather */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

243 /* than MATLAB_MEX_FILE. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

244 #define INDEX(i) (i)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

245 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

246

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

247 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

248 /* === Timer ================================================================ */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

249 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

250

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

251 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

252 If you have the getrusage routine (all Unix systems I've test do), then use

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

253 that. Otherwise, use the ANSI C clock function. Note that on many

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

254 systems, the ANSI clock function wraps around after only 2147 seconds, or

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

255 about 36 minutes. BE CAREFUL: if you compare the run time of UMFPACK with

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

256 other sparse matrix packages, be sure to use the same timer. See

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

257 umfpack_tictoc.c for the timer used internally by UMFPACK. See also

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

258 umfpack_timer.c for the timer used in an earlier version of UMFPACK.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

259 That timer is still available as a user-callable routine, but it is no

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

260 longer used internally by UMFPACK.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

261 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

262

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

263 /* Sun Solaris, SGI Irix, Linux, Compaq Alpha, and IBM RS 6000 all have */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

264 /* getrusage. It's in BSD unix, so perhaps all unix systems have it. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

265 #if defined (UMF_SOL2) || defined (UMF_SGI) || defined (UMF_LINUX) \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

266 || defined (UMF_ALPHA) || defined (UMF_AIX)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

267 #define GETRUSAGE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

268 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

269

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

270

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

271 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

272 /* === BLAS ================================================================= */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

273 /* ========================================================================== */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

274

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

275 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

276 The adventure begins. Figure out how to call the BLAS ...

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

277

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

278 This works, but it is incredibly ugly. The C-BLAS was supposed to solve

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

279 this problem, and make it easier to interface a C program to the BLAS.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

280 Unfortunately, the C-BLAS does not have a "long" integer (64 bit) version.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

281 Various vendors have done their own 64-bit BLAS. Sun has dgemm_64 routines

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

282 with "long" integers, SGI has a 64-bit dgemm in their scsl_blas_i8 library

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

283 with "long long" integers, and so on.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

284

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

285 Different vendors also have different ways of defining a complex number,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

286 some using struct's. That's a bad idea. See umf_version.h for the better

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

287 way to do it (the method that was also chosen for the complex C-BLAS,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

288 which is compatible and guaranteed to be portable with ANSI C).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

289

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

290 To make matters worse, SGI's SCSL BLAS has a C-BLAS interface which

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

291 differs from the ATLAS C-BLAS interface (see immediately below);

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

292 although a more recent version of SGI's C-BLAS interface is correct

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

293 if SCSL_VOID_ARGS is defined.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

294 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

295

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

296

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

297 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

298 /* Determine which BLAS to use. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

299 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

300

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

301 #if defined (MATHWORKS)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

302 #define USE_MATLAB_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

303

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

304 #elif defined (NBLAS)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

305 #define USE_NO_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

306

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

307 #elif defined (MATLAB_MEX_FILE)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

308 #define USE_MATLAB_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

309

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

310 #elif defined (CBLAS)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

311 #define USE_C_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

312

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

313 #elif defined (UMF_SOL2) && !defined (NSUNPERF)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

314 #define USE_SUNPERF_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

315

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

316 #elif defined (UMF_SGI) && !defined (NSCSL)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

317 #define USE_SCSL_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

318

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

319 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

320 #define USE_FORTRAN_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

321 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

322

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

323 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

324 /* int vs. long integer arguments */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

325 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

326

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

327 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

328 Determine if the BLAS exists for the long integer version. It exists if

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

329 LONGBLAS is defined in the Makefile, or if using the BLAS from the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

330 Sun Performance Library, or SGI's SCSL Scientific Library.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

331 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

332

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

333 #if defined (USE_SUNPERF_BLAS) || defined (USE_SCSL_BLAS)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

334 #ifndef LONGBLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

335 #define LONGBLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

336 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

337 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

338

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

339 /* do not use the BLAS if Int's are long and LONGBLAS is not defined */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

340 #if defined (LONG_INTEGER) && !defined (LONGBLAS) && !defined (USE_NO_BLAS)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

341 #define USE_NO_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

342 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

343

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

344

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

345 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

346 /* Use (void *) arguments for the SGI */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

347 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

348

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

349 #if defined (UMF_SGI)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

350 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

351 Use (void *) pointers for complex types in SCSL.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

352 The ATLAS C-BLAS, and the SGI C-BLAS differ. The former uses (void *)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

353 arguments, the latter uses SCSL_ZOMPLEX_T, which are either scsl_zomplex

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

354 or (void *). Using (void *) is simpler, and is selected by defining

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

355 SCSL_VOID_ARGS, below. The cc compiler doesn't complain, but gcc is

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

356 more picky, and generates a warning without this next statement.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

357 With gcc and the 07/09/98 version of SGI's cblas.h, spurious warnings

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

358 about complex BLAS arguments will be reported anyway. This is because this

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

359 older version of SGI's cblas.h does not make use of the SCSL_VOID_ARGS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

360 parameter, which is present in the 12/6/01 version of SGI's cblas.h. You

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

361 can safely ignore these warnings.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

362 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

363 #define SCSL_VOID_ARGS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

364 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

365

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

366

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

367 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

368 /* The BLAS exists, construct appropriate macros */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

369 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

370

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

371 #if !defined (USE_NO_BLAS) /* { */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

372

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

373 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

374 If the compile-time flag -DNBLAS is defined, then the BLAS are not used,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

375 portable vanilla C code is used instead, and the remainder of this file

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

376 is ignored.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

377

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

378 Using the BLAS is much faster, but how C calls the Fortran BLAS is

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

379 machine-dependent and thus can cause portability problems. Thus, use

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

380 -DNBLAS to ensure portability (at the expense of speed).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

381

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

382 Preferences:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

383

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

384 *** The best interface to use, regardless of the option you select

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

385 below, is the standard C-BLAS interface. Not all BLAS libraries

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

386 use this interface. The only problem with this interface is that

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

387 it does not extend to the LP64 model. The C-BLAS does not provide

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

388 for a 64-bit integer. In addition, SGI's older cblas.h can cause

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

389 spurious warnings when using the C-BLAS interface.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

390

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

391 1) often the most preferred (but see option (3)): use the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

392 optimized vendor-supplied library (such as the Sun Performance

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

393 Library, or IBM's ESSL). This is often the fastest, but might not

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

394 be portable and might not always be available. When compiling a

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

395 MATLAB mexFunction it might be difficult get the mex compiler

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

396 script to recognize the vendor- supplied BLAS. Note that the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

397 freely-available BLAS (option 3) can be faster than the vendor-

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

398 specific BLAS. You are encourage to try both option (1) and (3).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

399

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

400 2) When compiling the UMFPACK mexFunction to use UMFPACK in MATLAB, use

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

401 the BLAS provided by The Mathworks, Inc. This assumes you are using

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

402 MATLAB V6 or higher, since the BLAS are not incorporated in V5 or

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

403 earlier versions. On my Sun workstation, the MATLAB BLAS gave

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

404 slightly worse performance than the Sun Perf. BLAS. The advantage

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

405 of using the MATLAB BLAS is that it's available on any computer that

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

406 has MATLAB V6 or higher. I have not tried using MATLAB BLAS outside

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

407 of a mexFunction in a stand-alone C code, but MATLAB (V6) allows for

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

408 this. This is well worth trying if you have MATLAB and don't want

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

409 to bother installing the ATLAS BLAS (option 3a, below). The only

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

410 glitch to this is that MATLAB does not provide a portable interface

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

411 to the BLAS (an underscore is required for some but not all

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

412 architectures). For Windows and MATLAB 6.0 or 6.1, you also need

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

413 to copy the libmwlapack.dll file into your MATLAB installation

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

414 directory; see the User Guide for details.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

415

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

416 In the current distribution, the only BLAS that the UMFPACK

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

417 mexFunction will use is the internal MATLAB BLAS. It's possible to

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

418 use other BLAS, but handling the porting of using the mex compiler

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

419 with different BLAS libraries is not trivial.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

420

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

421 As of MATLAB 6.5, the BLAS used internally in MATLAB is the ATLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

422 BLAS.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

423

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

424 3) Use a freely-available high-performance BLAS library:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

425

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

426 (a) The BLAS by Kazashige Goto and Robert van de Geijn, at

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

427 http://www.cs.utexas.edu/users/flame/goto. This BLAS increased

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

428 the performance of UMFPACK by almost 50% as compared to the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

429 ATLAS BLAS (v3.2).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

430

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

431 (b) The ATLAS BLAS, available at http://www.netlib.org/atlas,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

432 by R. Clint Whaley, Antoine Petitet, and Jack Dongarra.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

433 This has a standard C interface, and thus the interface to it is

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

434 fully portable. Its performance rivals, and sometimes exceeds,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

435 the vendor-supplied BLAS on many computers.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

436

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

437 (b) The Fortran RISC BLAS by Michel Dayde', Iain Duff, Antoine

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

438 Petitet, and Abderrahim Qrichi Aniba, available via anonymous

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

439 ftp to ftp.enseeiht.fr in the pub/numerique/BLAS/RISC directory,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

440 See M. J. Dayde' and I. S. Duff, "The RISC BLAS: A blocked

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

441 implementation of level 3 BLAS for RISC processors, ACM Trans.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

442 Math. Software, vol. 25, no. 3., Sept. 1999. This will give

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

443 you good performance, but with the same C-to-Fortran portability

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

444 problems as option (1).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

445

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

446 4) Use UMFPACK's built-in vanilla C code by setting -DNBLAS at compile

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

447 time. The key advantage is portability, which is guaranteed if you

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

448 have an ANSI C compliant compiler. You also don't need to download

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

449 any other package - UMFPACK is stand-alone. No Fortran is used

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

450 anywhere in UMFPACK. UMFPACK will be much slower than when using

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

451 options (1) through (3), however.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

452

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

453 5) least preferred: use the standard Fortran implementation of the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

454 BLAS, also available at Netlib (http://www.netlib.org/blas). This

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

455 will be no faster than option (4), and not portable because of

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

456 C-to-Fortran calling conventions. Don't bother trying option (5).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

457

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

458 The mechanics of how C calls the BLAS on various computers are as follows:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

459

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

460 * C-BLAS (from the ATLAS library, for example):

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

461 The same interface is used on all computers.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

462

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

463 * Defaults for calling the Fortran BLAS:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

464 add underscore, pass scalars by reference, use string arguments.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

465

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

466 * The Fortran BLAS on Sun Solaris (when compiling the MATLAB mexFunction

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

467 or when using the Fortran RISC BLAS), SGI IRIX, Linux, and Compaq

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

468 Alpha: use defaults.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

469

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

470 * Sun Solaris (when using the C-callable Sun Performance library):

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

471 no underscore, pass scalars by value, use character arguments.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

472

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

473 * The Fortran BLAS (ESSL Library) on the IBM RS 6000, and HP Unix:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

474 no underscore, pass scalars by reference, use string arguments.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

475

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

476 * The Fortran BLAS on Windows:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

477 no underscore, pass scalars by reference, use string arguments.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

478 If you compile the umfpack mexFunction using umfpack_make, and are

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

479 using the lcc compiler bundled with MATLAB, then you must first

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

480 copy the umfpack\lcc_lib\libmwlapack.lib file into the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

481 <matlab>\extern\lib\win32\lcc\ directory, where <matlab> is the

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

482 directory in which MATLAB is installed. Next, type mex -setup

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

483 at the MATLAB prompt, and ask MATLAB to select the lcc compiler.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

484 MATLAB has built-in BLAS, but it cannot be accessed by a program

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

485 compiled by lcc without first copying this file.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

486 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

487

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

488

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

489

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

490 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

491 #ifdef USE_C_BLAS /* { */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

492 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

493

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

494

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

495 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

496 /* use the C-BLAS (any computer) */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

497 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

498

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

499 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

500 C-BLAS is the default interface, with the following exceptions. Solaris

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

501 uses the Sun Performance BLAS for libumfpack.a (the C-callable library).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

502 SGI IRIX uses the SCSL BLAS for libumfpack.a. All architectures use

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

503 MATLAB's internal BLAS for the mexFunction on any architecture. These

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

504 options are set in the Make.* files. The Make.generic file uses no BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

505 at all.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

506

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

507 If you use the ATLAS C-BLAS, then be sure to set the -I flag to

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

508 -I/path/ATLAS/include, where /path/ATLAS is the ATLAS installation

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

509 directory. See Make.solaris for an example. You do not need to do this

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

510 for the SGI, which has a /usr/include/cblas.h.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

511 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

512

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

513 #include "cblas.h"

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

514

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

515 #ifdef COMPLEX

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

516 #define BLAS_GEMM_ROUTINE cblas_zgemm

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

517 #define BLAS_TRSM_ROUTINE cblas_ztrsm

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

518 #define BLAS_TRSV_ROUTINE cblas_ztrsv

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

519 #define BLAS_GEMV_ROUTINE cblas_zgemv

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

520 #define BLAS_GER_ROUTINE cblas_zgeru

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

521 #define BLAS_SCAL_ROUTINE cblas_zscal

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

522 #define BLAS_COPY_ROUTINE cblas_zcopy

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

523 #define BLAS_DECLARE_SCALAR(x) double x [2]

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

524 #define BLAS_ASSIGN(x,xr,xi) { x [0] = xr ; x [1] = xi ; }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

525 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

526 #define BLAS_GEMM_ROUTINE cblas_dgemm

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

527 #define BLAS_TRSM_ROUTINE cblas_dtrsm

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

528 #define BLAS_TRSV_ROUTINE cblas_dtrsv

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

529 #define BLAS_GEMV_ROUTINE cblas_dgemv

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

530 #define BLAS_GER_ROUTINE cblas_dger

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

531 #define BLAS_SCAL_ROUTINE cblas_dscal

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

532 #define BLAS_COPY_ROUTINE cblas_dcopy

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

533 #define BLAS_DECLARE_SCALAR(x) double x

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

534 #define BLAS_ASSIGN(x,xr,xi) { x = xr ; }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

535 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

536

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

537 #define BLAS_LOWER CblasLower

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

538 #define BLAS_UNIT_DIAGONAL CblasUnit

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

539 #define BLAS_RIGHT CblasRight

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

540 #define BLAS_NO_TRANSPOSE CblasNoTrans

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

541 #define BLAS_TRANSPOSE CblasTrans

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

542 #define BLAS_COLUMN_MAJOR_ORDER CblasColMajor,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

543 #define BLAS_SCALAR(x) x

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

544 #define BLAS_INT_SCALAR(n) n

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

545 #define BLAS_ARRAY(a) a

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

546

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

547

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

548

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

549 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

550 #else /* } USE_C_BLAS { */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

551 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

552

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

553 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

554 /* use Fortran (or other architecture-specific) BLAS */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

555 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

556

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

557 /* No such argument when not using the C-BLAS */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

558 #define BLAS_COLUMN_MAJOR_ORDER

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

559

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

560 /* Determine which architecture we're on and set options accordingly. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

561 /* The default, if nothing is defined is to add an underscore, */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

562 /* pass scalars by reference, and use string arguments. */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

563

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

564 /* ---------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

565 /* Sun Performance BLAS */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

566 /* ---------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

567

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

568 #ifdef USE_SUNPERF_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

569 #ifdef _SUNPERF_H

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

570 /* <sunperf.h> has been included somehow anyway, outside of umf_config.h */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

571 #error "sunperf.h must NOT be #include'd. See umf_config.h for details."

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

572 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

573 #define BLAS_BY_VALUE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

574 #define BLAS_NO_UNDERSCORE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

575 #define BLAS_CHAR_ARG

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

576 #endif /* USE_SUNPERF_BLAS */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

577

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

578 /* ---------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

579 /* SGI SCSL BLAS */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

580 /* ---------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

581

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

582 #ifdef USE_SCSL_BLAS

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

583 #if defined (LP64)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

584 #include <scsl_blas_i8.h>

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

585 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

586 #include <scsl_blas.h>

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

587 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

588 #define BLAS_BY_VALUE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

589 #define BLAS_NO_UNDERSCORE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

590 #endif /* USE_SCSL_BLAS */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

591

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

592 /* ---------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

593 /* IBM AIX, Windows, and HP Fortran BLAS */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

594 /* ---------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

595

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

596 #if defined (UMF_AIX) || defined (UMF_WINDOWS) || defined (UMF_HP)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

597 #define BLAS_NO_UNDERSCORE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

598 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

599

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

600

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

601 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

602 /* BLAS names */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

603 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

604

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

605 #if defined (LP64) && defined (USE_SUNPERF_BLAS) && defined (LONG_INTEGER)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

606

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

607 /* 64-bit sunperf BLAS, for Sun Solaris only */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

608 #ifdef COMPLEX

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

609 #define BLAS_GEMM_ROUTINE zgemm_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

610 #define BLAS_TRSM_ROUTINE ztrsm_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

611 #define BLAS_TRSV_ROUTINE ztrsv_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

612 #define BLAS_GEMV_ROUTINE zgemv_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

613 #define BLAS_GER_ROUTINE zgeru_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

614 #define BLAS_SCAL_ROUTINE zscal_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

615 #define BLAS_COPY_ROUTINE zcopy_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

616 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

617 #define BLAS_GEMM_ROUTINE dgemm_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

618 #define BLAS_TRSM_ROUTINE dtrsm_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

619 #define BLAS_TRSV_ROUTINE dtrsv_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

620 #define BLAS_GEMV_ROUTINE dgemv_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

621 #define BLAS_GER_ROUTINE dger_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

622 #define BLAS_SCAL_ROUTINE dscal_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

623 #define BLAS_COPY_ROUTINE dcopy_64

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

624 #endif /* COMPLEX */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

625

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

626 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

627

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

628 #ifdef COMPLEX

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

629

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

630 /* naming convention (use underscore, or not) */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

631 #ifdef BLAS_NO_UNDERSCORE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

632 #define BLAS_GEMM_ROUTINE zgemm

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

633 #define BLAS_TRSM_ROUTINE ztrsm

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

634 #define BLAS_TRSV_ROUTINE ztrsv

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

635 #define BLAS_GEMV_ROUTINE zgemv

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

636 #define BLAS_GER_ROUTINE zgeru

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

637 #define BLAS_SCAL_ROUTINE zscal

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

638 #define BLAS_COPY_ROUTINE zcopy

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

639 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

640 /* default: add underscore */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

641 #define BLAS_GEMM_ROUTINE zgemm_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

642 #define BLAS_TRSM_ROUTINE ztrsm_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

643 #define BLAS_TRSV_ROUTINE ztrsv_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

644 #define BLAS_GEMV_ROUTINE zgemv_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

645 #define BLAS_GER_ROUTINE zgeru_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

646 #define BLAS_SCAL_ROUTINE zscal_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

647 #define BLAS_COPY_ROUTINE zcopy_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

648 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

649

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

650 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

651

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

652 /* naming convention (use underscore, or not) */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

653 #ifdef BLAS_NO_UNDERSCORE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

654 #define BLAS_GEMM_ROUTINE dgemm

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

655 #define BLAS_TRSM_ROUTINE dtrsm

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

656 #define BLAS_TRSV_ROUTINE dtrsv

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

657 #define BLAS_GEMV_ROUTINE dgemv

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

658 #define BLAS_GER_ROUTINE dger

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

659 #define BLAS_SCAL_ROUTINE dscal

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

660 #define BLAS_COPY_ROUTINE dcopy

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

661 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

662 /* default: add underscore */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

663 #define BLAS_GEMM_ROUTINE dgemm_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

664 #define BLAS_TRSM_ROUTINE dtrsm_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

665 #define BLAS_TRSV_ROUTINE dtrsv_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

666 #define BLAS_GEMV_ROUTINE dgemv_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

667 #define BLAS_GER_ROUTINE dger_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

668 #define BLAS_SCAL_ROUTINE dscal_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

669 #define BLAS_COPY_ROUTINE dcopy_

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

670 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

671

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

672 #endif /* COMPLEX */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

673

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

674 #endif /* LP64 && USE_SUNPERF_BLAS */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

675

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

676

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

677 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

678 /* BLAS real or complex floating-point scalars */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

679 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

680

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

681 #ifdef COMPLEX

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

682

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

683 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

684 The SunPerf BLAS expects to see a doublecomplex scalar, but it

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

685 also will accept an array of size 2. See the manual, normally at

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

686 file:///opt/SUNWspro/WS6U1/lib/locale/C/html/manuals/perflib/user_guide

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

687 /plug_using_perflib.html . This manual is inconsistent with the man pages

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

688 for zgemm, zgemv, and zgeru and also inconsistent with the <sunperf.h>

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

689 include file. Use this instead, for SunPerf (only works if you do NOT

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

690 include sunperf.h). Fortunately, this file (umf_config.h) is not included

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

691 in any user code that calls UMFPACK. Thus, the caller may include

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

692 sunperf.h in his or her own code, and that is safely ignored here.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

693 SGI's SCSL BLAS has yet a different kind of struct, but we can use a

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

694 double array of size 2 instead (since SCSL_VOID_ARGS is defined).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

695 Most BLAS expect complex scalars as pointers to double arrays of size 2.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

696 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

697

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

698 #define BLAS_DECLARE_SCALAR(x) double x [2]

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

699 #define BLAS_ASSIGN(x,xr,xi) { x [0] = xr ; x [1] = xi ; }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

700 #define BLAS_SCALAR(x) x

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

701

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

702 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

703

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

704 #define BLAS_DECLARE_SCALAR(x) double x

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

705 #define BLAS_ASSIGN(x,xr,xi) { x = xr ; }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

706 #ifdef BLAS_BY_VALUE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

707 #define BLAS_SCALAR(x) x

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

708 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

709 #define BLAS_SCALAR(x) &(x)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

710 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

711

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

712 #endif /* COMPLEX */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

713

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

714

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

715 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

716 /* BLAS integer scalars */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

717 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

718

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

719 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

720 Fortran requires integers to be passed by reference.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

721 The SCSL BLAS requires long long arguments in LP64 mode.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

722 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

723

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

724 #if defined (USE_SCSL_BLAS) && defined (LP64)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

725 #define BLAS_INT_SCALAR(n) ((long long) n)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

726 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

727 #ifdef BLAS_BY_VALUE

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

728 #define BLAS_INT_SCALAR(n) n

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

729 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

730 #define BLAS_INT_SCALAR(n) &(n)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

731 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

732 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

733

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

734

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

735 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

736 /* BLAS strings */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

737 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

738

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

739 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

740 The Sun Performance BLAS wants a character instead of a string.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

741 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

742

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

743 #ifdef BLAS_CHAR_ARG

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

744 #define BLAS_NO_TRANSPOSE 'N'

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

745 #define BLAS_TRANSPOSE 'T'

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

746 #define BLAS_LEFT 'L'

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

747 #define BLAS_RIGHT 'R'

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

748 #define BLAS_LOWER 'L'

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

749 #define BLAS_UNIT_DIAGONAL 'U'

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

750 #else

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

751 #define BLAS_NO_TRANSPOSE "N"

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

752 #define BLAS_TRANSPOSE "T"

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

753 #define BLAS_LEFT "L"

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

754 #define BLAS_RIGHT "R"

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

755 #define BLAS_LOWER "L"

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

756 #define BLAS_UNIT_DIAGONAL "U"

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

757 #endif

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

758

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

759

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

760 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

761 /* BLAS arrays */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

762 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

763

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

764 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

765 The complex SunPerf BLAS expects to see a doublecomplex array of size s.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

766 This is broken (see above, regarding complex scalars in sunperf.h).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

767 For SunPerf BLAS, just pass a pointer to the array, and ignore sunperf.h.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

768 With sunperf.h, you would need:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

769

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

770 #define BLAS_ARRAY(a) ((doublecomplex *)(a))

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

771

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

772 SGI's SCSL BLAS has yet a different kind of struct, but we can use a

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

773 double array of size 2 instead (since SCSL_VOID_ARGS is defined).

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

774

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

775 The real versions all use just a (double *) pointer.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

776

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

777 In all cases, no typecast is required. This will break if <sunperf.h> is

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

778 included.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

779

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

780 If you have read this far, I hope you see now why (void *) a much better

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

781 choice for complex BLAS prototypes, and why double x [2] is better than

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

782 an architecture dependent struct { double real ; double imag ; }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

783 type definition.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

784

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

785 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

786

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

787 #define BLAS_ARRAY(a) (a)

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

788

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

789

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

790 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

791 #endif /* USE_C_BLAS } */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

792 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

793

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

794

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

795

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

796

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

797

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

798 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

799 /* BLAS macros, for all interfaces */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

800 /* -------------------------------------------------------------------------- */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

801

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

802 /*

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

803 All architecture dependent issues have now been taken into consideration,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

804 and folded into the macros BLAS_DECLARE_SCALAR, BLAS_ASSIGN, BLAS_*_ROUTINE,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

805 BLAS_COLUMN_MAJOR_ORDER, BLAS_NO_TRANSPOSE, BLAS_TRANSPOSE, BLAS_SCALAR,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

806 BLAS_INT_SCALAR, BLAS_ARRAY, and Int.

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

807

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

808 You will note that there is not a *** single *** name, declaration, or

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

809 argument to the BLAS which is not somehow different in one or more versions

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

810 of the BLAS!

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

811 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

812

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

813

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

814 /* C = C - A*B', where:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

815 * A is m-by-k with leading dimension ldac

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

816 * B is k-by-n with leading dimension ldb

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

817 * C is m-by-n with leading dimension ldac */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

818 #define BLAS_GEMM(m,n,k,A,B,ldb,C,ldac) \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

819 { \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

820 BLAS_DECLARE_SCALAR (alpha) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

821 BLAS_DECLARE_SCALAR (beta) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

822 BLAS_ASSIGN (alpha, -1.0, 0.0) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

823 BLAS_ASSIGN (beta, 1.0, 0.0) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

824 (void) BLAS_GEMM_ROUTINE (BLAS_COLUMN_MAJOR_ORDER \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

825 BLAS_NO_TRANSPOSE, BLAS_TRANSPOSE, \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

826 BLAS_INT_SCALAR (m), BLAS_INT_SCALAR (n), BLAS_INT_SCALAR (k), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

827 BLAS_SCALAR (alpha), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

828 BLAS_ARRAY (A), BLAS_INT_SCALAR (ldac), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

829 BLAS_ARRAY (B), BLAS_INT_SCALAR (ldb), BLAS_SCALAR (beta), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

830 BLAS_ARRAY (C), BLAS_INT_SCALAR (ldac)) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

831 }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

832

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

833 /* A = A - x*y', where:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

834 * A is m-by-n with leading dimension d

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

835 x is a column vector with stride 1

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

836 y is a column vector with stride 1 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

837 #define BLAS_GER(m,n,x,y,A,d) \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

838 { \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

839 Int one = 1 ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

840 BLAS_DECLARE_SCALAR (alpha) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

841 BLAS_ASSIGN (alpha, -1.0, 0.0) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

842 (void) BLAS_GER_ROUTINE (BLAS_COLUMN_MAJOR_ORDER \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

843 BLAS_INT_SCALAR (m), BLAS_INT_SCALAR (n), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

844 BLAS_SCALAR (alpha), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

845 BLAS_ARRAY (x), BLAS_INT_SCALAR (one), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

846 BLAS_ARRAY (y), BLAS_INT_SCALAR (one), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

847 BLAS_ARRAY (A), BLAS_INT_SCALAR (d)) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

848 }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

849

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

850 /* y = y - A*x, where A is m-by-n with leading dimension d,

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

851 x is a column vector with stride 1

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

852 y is a column vector with stride 1 */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

853 #define BLAS_GEMV(m,n,A,x,y,d) \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

854 { \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

855 Int one = 1 ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

856 BLAS_DECLARE_SCALAR (alpha) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

857 BLAS_DECLARE_SCALAR (beta) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

858 BLAS_ASSIGN (alpha, -1.0, 0.0) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

859 BLAS_ASSIGN (beta, 1.0, 0.0) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

860 (void) BLAS_GEMV_ROUTINE (BLAS_COLUMN_MAJOR_ORDER \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

861 BLAS_NO_TRANSPOSE, \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

862 BLAS_INT_SCALAR (m), BLAS_INT_SCALAR (n), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

863 BLAS_SCALAR (alpha), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

864 BLAS_ARRAY (A), BLAS_INT_SCALAR (d), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

865 BLAS_ARRAY (x), BLAS_INT_SCALAR (one), BLAS_SCALAR (beta), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

866 BLAS_ARRAY (y), BLAS_INT_SCALAR (one)) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

867 }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

868

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

869

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

870 /* solve Lx=b, where:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

871 * B is a column vector (m-by-1) with leading dimension d

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

872 * A is m-by-m with leading dimension d */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

873 #define BLAS_TRSV(m,A,b,d) \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

874 { \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

875 Int one = 1 ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

876 (void) BLAS_TRSV_ROUTINE (BLAS_COLUMN_MAJOR_ORDER \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

877 BLAS_LOWER, BLAS_NO_TRANSPOSE, BLAS_UNIT_DIAGONAL, \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

878 BLAS_INT_SCALAR (m), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

879 BLAS_ARRAY (A), BLAS_INT_SCALAR (d), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

880 BLAS_ARRAY (b), BLAS_INT_SCALAR (one)) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

881 }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

882

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

883 /* solve XL'=B where:

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

884 * B is m-by-n with leading dimension ldb

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

885 * A is n-by-n with leading dimension lda */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

886 #define BLAS_TRSM_RIGHT(m,n,A,lda,B,ldb) \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

887 { \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

888 BLAS_DECLARE_SCALAR (alpha) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

889 BLAS_ASSIGN (alpha, 1.0, 0.0) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

890 (void) BLAS_TRSM_ROUTINE (BLAS_COLUMN_MAJOR_ORDER \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

891 BLAS_RIGHT, BLAS_LOWER, BLAS_TRANSPOSE, BLAS_UNIT_DIAGONAL, \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

892 BLAS_INT_SCALAR (m), BLAS_INT_SCALAR (n), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

893 BLAS_SCALAR (alpha), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

894 BLAS_ARRAY (A), BLAS_INT_SCALAR (lda), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

895 BLAS_ARRAY (B), BLAS_INT_SCALAR (ldb)) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

896 }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

897

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

898 /* x = s*x, where x is a stride-1 vector of length n */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

899 #define BLAS_SCAL(n,s,x) \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

900 { \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

901 Int one = 1 ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

902 BLAS_DECLARE_SCALAR (alpha) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

903 BLAS_ASSIGN (alpha, REAL_COMPONENT (s), IMAG_COMPONENT (s)) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

904 (void) BLAS_SCAL_ROUTINE ( \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

905 BLAS_INT_SCALAR (n), BLAS_SCALAR (alpha), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

906 BLAS_ARRAY (x), BLAS_INT_SCALAR (one)) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

907 }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

908

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

909 /* x = y, where x and y are a stride-1 vectors of length n */

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

910 #define BLAS_COPY(n,x,y) \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

911 { \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

912 Int one = 1 ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

913 (void) BLAS_COPY_ROUTINE ( \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

914 BLAS_INT_SCALAR (n), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

915 BLAS_ARRAY (x), BLAS_INT_SCALAR (one), \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

916 BLAS_ARRAY (y), BLAS_INT_SCALAR (one)) ; \

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

917 }

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

918

57077d0ddc8e [project @ 2005-02-25 19:55:24 by jwe]

jwe

parents:

diff changeset

919 #endif /* !defined (USE_NO_BLAS) } */

Mercurial > octave-nkf

annotate liboctave/UMFPACK/UMFPACK/Source/umf_config.h @ 5164:57077d0ddc8e