1 @node Variable Attributes
2 @chapter Manipulating variables
4 The variables in the active dataset dictionary are important. There are
5 several utility functions for examining and adjusting them.
8 * ADD VALUE LABELS:: Add value labels to variables.
9 * DELETE VARIABLES:: Delete variables.
10 * DISPLAY:: Display information about the active dataset.
11 * FORMATS:: Set print and write formats.
12 * LEAVE:: Don't clear variables between cases.
13 * MISSING VALUES:: Set missing values for variables.
14 * MODIFY VARS:: Rename, reorder, and drop variables.
15 * MRSETS:: Add, modify, and list multiple response sets.
16 * NUMERIC:: Create new numeric variables.
17 * PRINT FORMATS:: Set variable print formats.
18 * RENAME VARIABLES:: Rename variables.
19 * VALUE LABELS:: Set value labels for variables.
20 * STRING:: Create new string variables.
21 * VARIABLE ATTRIBUTE:: Set custom attributes on variables.
22 * VARIABLE LABELS:: Set variable labels for variables.
23 * VARIABLE ALIGNMENT:: Set the alignment for display.
24 * VARIABLE WIDTH:: Set the display width.
25 * VARIABLE LEVEL:: Set the measurement level.
26 * VARIABLE ROLE:: Set the role that a variable fills in analysis.
27 * VECTOR:: Declare an array of variables.
28 * WRITE FORMATS:: Set variable write formats.
31 @node ADD VALUE LABELS
32 @section ADD VALUE LABELS
33 @vindex ADD VALUE LABELS
37 /@var{var_list} @var{value} '@var{label}' [@var{value} '@var{label}']@dots{}
40 @cmd{ADD VALUE LABELS} has the same syntax and purpose as @cmd{VALUE
41 LABELS} (@pxref{VALUE LABELS}), but it does not clear value
42 labels from the variables before adding the ones specified.
44 @node DELETE VARIABLES
45 @section DELETE VARIABLES
46 @vindex DELETE VARIABLES
49 DELETE VARIABLES @var{var_list}.
52 @cmd{DELETE VARIABLES} deletes the specified variables from the
53 dictionary. It may not be used to delete all variables from the
54 dictionary; use @cmd{NEW FILE} to do that (@pxref{NEW FILE}).
56 @cmd{DELETE VARIABLES} should not be used after defining transformations
57 but before executing a procedure. If it is used in such a context, it
58 causes the data to be read. If it is used while @cmd{TEMPORARY} is in
59 effect, it causes the temporary transformations to become permanent.
66 DISPLAY [SORTED] NAMES [[/VARIABLES=]@var{var_list}].
67 DISPLAY [SORTED] INDEX [[/VARIABLES=]@var{var_list}].
68 DISPLAY [SORTED] LABELS [[/VARIABLES=]@var{var_list}].
69 DISPLAY [SORTED] VARIABLES [[/VARIABLES=]@var{var_list}].
70 DISPLAY [SORTED] DICTIONARY [[/VARIABLES=]@var{var_list}].
71 DISPLAY [SORTED] SCRATCH [[/VARIABLES=]@var{var_list}].
72 DISPLAY [SORTED] ATTRIBUTES [[/VARIABLES=]@var{var_list}].
73 DISPLAY [SORTED] @@ATTRIBUTES [[/VARIABLES=]@var{var_list}].
74 DISPLAY [SORTED] VECTORS.
77 @cmd{DISPLAY} displays information about the active dataset. A variety
78 of different forms of information can be requested.
80 The following keywords primarily cause information about variables to
81 be displayed. With these keywords, by default information is
82 displayed about all variable in the active dataset, in the order that
83 variables occur in the active dataset dictionary. The @subcmd{SORTED} keyword
84 causes output to be sorted alphabetically by variable name. The
85 @subcmd{VARIABLES} subcommand limits output to the specified variables.
89 The variables' names are displayed.
92 The variables' names are displayed along with a value describing their
93 position within the active dataset dictionary.
96 Variable names, positions, and variable labels are displayed.
99 Variable names, positions, print and write formats, and missing values
103 Variable names, positions, print and write formats, missing values,
104 variable labels, and value labels are displayed.
107 Variable names are displayed, for scratch variables only (@pxref{Scratch
112 Datafile and variable attributes are displayed.
113 The first form of the command omits those attributes
114 whose names begin with @code{@@} or @code{$@@}.
115 In the second for, all datafile and variable attributes are displayed.
118 With the @code{VECTOR} keyword, @cmd{DISPLAY} lists all the currently
119 declared vectors. If the @subcmd{SORTED} keyword is given, the vectors are
120 listed in alphabetical order; otherwise, they are listed in textual
121 order of definition within the @pspp{} syntax file.
123 For related commands, see @ref{DISPLAY DOCUMENTS} and @ref{DISPLAY
131 FORMATS @var{var_list} (@var{fmt_spec}) [@var{var_list} (@var{fmt_spec})]@dots{}.
134 @cmd{FORMATS} set both print and write formats for the specified
135 variables to the specified format specification.
136 @xref{Input and Output Formats}.
138 Specify a list of variables followed by a format specification in
139 parentheses. The print and write formats of the specified variables
140 will be changed. All of the variables listed together must have
141 the same type and, for string variables, the same width.
143 Additional lists of variables and formats may be included following
146 @cmd{FORMATS} takes effect immediately. It is not affected by
147 conditional and looping structures such as @cmd{DO IF} or @cmd{LOOP}.
154 LEAVE @var{var_list}.
157 @cmd{LEAVE} prevents the specified variables from being
158 reinitialized whenever a new case is processed.
160 Normally, when a data file is processed, every variable in the active
161 dataset is initialized to the system-missing value or spaces at the
162 beginning of processing for each case. When a variable has been
163 specified on @cmd{LEAVE}, this is not the case. Instead, that variable is
164 initialized to 0 (not system-missing) or spaces for the first case.
165 After that, it retains its value between cases.
167 This becomes useful for counters. For instance, in the example below
168 the variable @code{SUM} maintains a running total of the values in the @code{ITEM}
173 COMPUTE SUM=SUM+ITEM.
184 @noindent Partial output from this example:
193 It is best to use @cmd{LEAVE} command immediately before invoking a
194 procedure command, because the left status of variables is reset by
195 certain transformations---for instance, @cmd{COMPUTE} and @cmd{IF}.
196 Left status is also reset by all procedure invocations.
199 @section MISSING VALUES
200 @vindex MISSING VALUES
203 MISSING VALUES @var{var_list} (@var{missing_values}).
205 where @var{missing_values} takes one of the following forms:
207 @var{num1}, @var{num2}
208 @var{num1}, @var{num2}, @var{num3}
209 @var{num1} THRU @var{num2}
210 @var{num1} THRU @var{num2}, @var{num3}
212 @var{string1}, @var{string2}
213 @var{string1}, @var{string2}, @var{string3}
214 As part of a range, @subcmd{LO} or @subcmd{LOWEST} may take the place of @var{num1};
215 @subcmd{HI} or @subcmd{HIGHEST} may take the place of @var{num2}.
218 @cmd{MISSING VALUES} sets user-missing values for numeric and string
219 variables. Long string variables may have missing values, but
220 characters after the first 8 bytes of the missing value must be
223 Specify a list of variables, followed by a list of their user-missing
224 values in parentheses. Up to three discrete values may be given, or,
225 for numeric variables only, a range of values optionally accompanied by
226 a single discrete value. Ranges may be open-ended on one end, indicated
227 through the use of the
228 keyword @subcmd{LO} or @subcmd{LOWEST} or @subcmd{HI} or @subcmd{HIGHEST}.
230 The @cmd{MISSING VALUES} command takes effect immediately. It is not
231 affected by conditional and looping constructs such as @cmd{DO IF} or
240 /REORDER=@{FORWARD,BACKWARD@} @{POSITIONAL,ALPHA@} (@var{var_list})@dots{}
241 /RENAME=(@var{old_names}=@var{new_names})@dots{}
242 /@{DROP,KEEP@}=@var{var_list}
246 @cmd{MODIFY VARS} reorders, renames, and deletes variables in the
249 At least one subcommand must be specified, and no subcommand may be
250 specified more than once. @subcmd{DROP} and @subcmd{KEEP} may not both
253 The @subcmd{REORDER} subcommand changes the order of variables in the active
254 dataset. Specify one or more lists of variable names in parentheses. By
255 default, each list of variables is rearranged into the specified order.
256 To put the variables into the reverse of the specified order, put
257 keyword @subcmd{BACKWARD} before the parentheses. To put them into alphabetical
258 order in the dictionary, specify keyword @subcmd{ALPHA} before the parentheses.
259 @subcmd{BACKWARD} and @subcmd{ALPHA} may also be combined.
261 To rename variables in the active dataset, specify @subcmd{RENAME}, an equals sign
262 (@samp{=}), and lists of the old variable names and new variable names
263 separated by another equals sign within parentheses. There must be the
264 same number of old and new variable names. Each old variable is renamed to
265 the corresponding new variable name. Multiple parenthesized groups of
266 variables may be specified.
268 The @subcmd{DROP} subcommand deletes a specified list of variables from the
271 The @subcmd{KEEP} subcommand keeps the specified list of variables in the active
272 dataset. Any unlisted variables are deleted from the active dataset.
274 @subcmd{MAP} is currently ignored.
276 If either @subcmd{DROP} or @subcmd{KEEP} is specified, the data is read;
279 @cmd{MODIFY VARS} may not be specified following @cmd{TEMPORARY}
288 /MDGROUP NAME=@var{name} VARIABLES=@var{var_list} VALUE=@var{value}
289 [CATEGORYLABELS=@{VARLABELS,COUNTEDVALUES@}]
290 [@{LABEL='@var{label}',LABELSOURCE=VARLABEL@}]
292 /MCGROUP NAME=@var{name} VARIABLES=@var{var_list} [LABEL='@var{label}']
294 /DELETE NAME=@{[@var{names}],ALL@}
296 /DISPLAY NAME=@{[@var{names}],ALL@}
299 @cmd{MRSETS} creates, modifies, deletes, and displays multiple
300 response sets. A multiple response set is a set of variables that
301 represent multiple responses to a single survey question in one of the
306 A @dfn{multiple dichotomy set} is analogous to a survey question with
307 a set of checkboxes. Each variable in the set is treated in a Boolean
308 fashion: one value (the "counted value") means that the box was
309 checked, and any other value means that it was not.
312 A @dfn{multiple category set} represents a survey question where the
313 respondent is instructed to list up to @var{n} choices. Each variable
314 represents one of the responses.
317 Any number of subcommands may be specified in any order.
319 The @subcmd{MDGROUP} subcommand creates a new multiple dichotomy set or
320 replaces an existing multiple response set. The @subcmd{NAME},
321 @subcmd{VARIABLES}, and
322 @subcmd{VALUE} specifications are required. The others are optional:
326 @var{NAME} specifies the name used in syntax for the new multiple dichotomy
327 set. The name must begin with @samp{$}; it must otherwise follow the
328 rules for identifiers (@pxref{Tokens}).
331 @subcmd{VARIABLES} specifies the variables that belong to the set. At least
332 two variables must be specified. The variables must be all string or
336 @subcmd{VALUE} specifies the counted value. If the variables are numeric, the
337 value must be an integer. If the variables are strings, then the
338 value must be a string that is no longer than the shortest of the
339 variables in the set (ignoring trailing spaces).
342 @subcmd{CATEGORYLABELS} optionally specifies the source of the labels for each
347 @subcmd{VARLABELS}, the default, uses variable labels or, for variables without
348 variable labels, variable names. @pspp{} warns if two variables have the
349 same variable label, since these categories cannot be distinguished in
353 @subcmd{COUNTEDVALUES} instead uses each variable's value label for the counted
354 value. @pspp{} warns if two variables have the same value label for the
355 counted value or if one of the variables lacks a value label, since
356 such categories cannot be distinguished in output.
360 @subcmd{LABEL} optionally specifies a label for the multiple response set. If
361 neither @subcmd{LABEL} nor @subcmd{LABELSOURCE=VARLABEL} is specified, the set is
365 @subcmd{LABELSOURCE=VARLABEL} draws the multiple response set's label from the
366 first variable label among the variables in the set; if none of the
367 variables has a label, the name of the first variable is used.
368 @subcmd{LABELSOURCE=VARLABEL} must be used with @subcmd{CATEGORYLABELS=COUNTEDVALUES}.
369 It is mutually exclusive with @subcmd{LABEL}.
372 The @subcmd{MCGROUP} subcommand creates a new multiple category set or
373 replaces an existing multiple response set. The @subcmd{NAME} and @subcmd{VARIABLES}
374 specifications are required, and @subcmd{LABEL} is optional. Their meanings
375 are as described above in @subcmd{MDGROUP}. @pspp{} warns if two variables in the
376 set have different value labels for a single value, since each of the
377 variables in the set should have the same possible categories.
379 The @subcmd{DELETE} subcommand deletes multiple response groups. A list of
380 groups may be named within a set of required square brackets, or ALL
381 may be used to delete all groups.
383 The @subcmd{DISPLAY} subcommand displays information about defined multiple
384 response sets. Its syntax is the same as the @subcmd{DELETE} subcommand.
386 Multiple response sets are saved to and read from system files by,
387 e.g., the @cmd{SAVE} and @cmd{GET} command. Otherwise, multiple
388 response sets are currently used only by third party software.
395 NUMERIC /@var{var_list} [(@var{fmt_spec})].
398 @cmd{NUMERIC} explicitly declares new numeric variables, optionally
399 setting their output formats.
401 Specify a slash (@samp{/}), followed by the names of the new numeric
402 variables. If you wish to set their output formats, follow their names
403 by an output format specification in parentheses (@pxref{Input and Output
404 Formats}); otherwise, the default is F8.2.
406 Variables created with @cmd{NUMERIC} are initialized to the
407 system-missing value.
410 @section PRINT FORMATS
411 @vindex PRINT FORMATS
414 PRINT FORMATS @var{var_list} (@var{fmt_spec}) [@var{var_list} (@var{fmt_spec})]@dots{}.
417 @cmd{PRINT FORMATS} sets the print formats for the specified
418 variables to the specified format specification.
420 Its syntax is identical to that of @cmd{FORMATS} (@pxref{FORMATS}),
421 but @cmd{PRINT FORMATS} sets only print formats, not write formats.
423 @node RENAME VARIABLES
424 @section RENAME VARIABLES
425 @vindex RENAME VARIABLES
428 RENAME VARIABLES (@var{old_names}=@var{new_names})@dots{} .
431 @cmd{RENAME VARIABLES} changes the names of variables in the active
432 dataset. Specify lists of the old variable names and new
433 variable names, separated by an equals sign (@samp{=}), within
434 parentheses. There must be the same number of old and new variable
435 names. Each old variable is renamed to the corresponding new variable
436 name. Multiple parenthesized groups of variables may be specified.
437 When the old and new variable names contain only a single variable name,
438 the parentheses are optional.
440 @cmd{RENAME VARIABLES} takes effect immediately. It does not cause the data
443 @cmd{RENAME VARIABLES} may not be specified following @cmd{TEMPORARY}
447 @section VALUE LABELS
452 /@var{var_list} @var{value} '@var{label}' [@var{value} '@var{label}']@dots{}
455 @cmd{VALUE LABELS} allows values of numeric and short string
456 variables to be associated with labels. In this way, a short value can
457 stand for a long value.
459 To set up value labels for a set of variables, specify the
460 variable names after a slash (@samp{/}), followed by a list of values
461 and their associated labels, separated by spaces.
463 Value labels in output are normally broken into lines automatically.
464 Put @samp{\n} in a label string to force a line break at that point.
465 The label may still be broken into lines at additional points.
467 Before @cmd{VALUE LABELS} is executed, any existing value labels
468 are cleared from the variables specified. Use @cmd{ADD VALUE LABELS}
469 (@pxref{ADD VALUE LABELS}) to add value labels without clearing those
477 STRING @var{var_list} (@var{fmt_spec}) [/@var{var_list} (@var{fmt_spec})] [@dots{}].
480 @cmd{STRING} creates new string variables for use in
483 Specify a list of names for the variable you want to create,
484 followed by the desired output format specification in
485 parentheses (@pxref{Input and Output Formats}).
487 implicitly derived from the specified output formats.
488 The created variables will be initialized to spaces.
490 If you want to create several variables with distinct
491 output formats, you can either use two or more separate @cmd{STRING} commands,
492 or you can specify further variable list and format specification pairs, each separated
493 from the previous by a slash (@samp{/}).
495 The following example is one way to create three string variables; Two of the
496 variables have format A24 and the other A80:
498 STRING firstname lastname (A24) / address (A80).
501 @noindent Here is another way to achieve the same result:
503 STRING firstname lastname (A24).
504 STRING address (A80).
507 @noindent @dots{} and here is yet another way:
510 STRING firstname (A24).
511 STRING lastname (A24).
512 STRING address (A80).
518 @node VARIABLE ATTRIBUTE
519 @section VARIABLE ATTRIBUTE
520 @vindex VARIABLE ATTRIBUTE
524 VARIABLES=@var{var_list}
525 ATTRIBUTE=@var{name}('@var{value}') [@var{name}('@var{value}')]@dots{}
526 ATTRIBUTE=@var{name}@b{[}@var{index}@b{]}('@var{value}') [@var{name}@b{[}@var{index}@b{]}('@var{value}')]@dots{}
527 DELETE=@var{name} [@var{name}]@dots{}
528 DELETE=@var{name}@b{[}@var{index}@b{]} [@var{name}@b{[}@var{index}@b{]}]@dots{}
531 @cmd{VARIABLE ATTRIBUTE} adds, modifies, or removes user-defined
532 attributes associated with variables in the active dataset. Custom
533 variable attributes are not interpreted by @pspp{}, but they are saved as
534 part of system files and may be used by other software that reads
537 The required @subcmd{VARIABLES} subcommand must come first. Specify the
538 variables to which the following @subcmd{ATTRIBUTE} or @subcmd{DELETE} subcommand
541 Use the @subcmd{ATTRIBUTE} subcommand to add or modify custom variable
542 attributes. Specify the name of the attribute as an identifier
543 (@pxref{Tokens}), followed by the desired value, in parentheses, as a
544 quoted string. The specified attributes are then added or modified in
545 the variables specified on @subcmd{VARIABLES}. Attribute names that begin with
546 @code{$} are reserved for @pspp{}'s internal use, and attribute names
547 that begin with @code{@@} or @code{$@@} are not displayed by most @pspp{}
548 commands that display other attributes. Other attribute names are not
551 Attributes may also be organized into arrays. To assign to an array
552 element, add an integer array index enclosed in square brackets
553 (@code{[} and @code{]}) between the attribute name and value. Array
554 indexes start at 1, not 0. An attribute array that has a single
555 element (number 1) is not distinguished from a non-array attribute.
557 Use the @subcmd{DELETE} subcommand to delete an attribute from the variable
558 specified on @subcmd{VARIABLES}. Specify an attribute name by itself to delete
559 an entire attribute, including all array elements for attribute
560 arrays. Specify an attribute name followed by an array index in
561 square brackets to delete a single element of an attribute array. In
562 the latter case, all the array elements numbered higher than the
563 deleted element are shifted down, filling the vacated position.
565 To associate custom attributes with the entire active dataset, instead of
566 with particular variables, use @cmd{DATAFILE ATTRIBUTE} (@pxref{DATAFILE ATTRIBUTE}) instead.
568 @cmd{VARIABLE ATTRIBUTE} takes effect immediately. It is not affected
569 by conditional and looping structures such as @cmd{DO IF} or
572 @node VARIABLE LABELS
573 @section VARIABLE LABELS
574 @vindex VARIABLE LABELS
578 @var{var_list} '@var{var_label}'
579 [ /@var{var_list} '@var{var_label}']
583 [ /@var{var_list} '@var{var_label}']
586 @cmd{VARIABLE LABELS} associates explanatory names
587 with variables. This name, called a @dfn{variable label}, is displayed by
588 statistical procedures.
590 To assign a variable label to a group of variables, specify a
591 list of variable names and the variable label as a string.
592 To assign different labels to different variables in the same command,
593 precede the subsequent variable list with a slash (@samp{/}).
596 @node VARIABLE ALIGNMENT
597 @section VARIABLE ALIGNMENT
598 @vindex VARIABLE ALIGNMENT
602 @var{var_list} ( LEFT | RIGHT | CENTER )
603 [ /@var{var_list} ( LEFT | RIGHT | CENTER ) ]
607 [ /@var{var_list} ( LEFT | RIGHT | CENTER ) ]
610 @cmd{VARIABLE ALIGNMENT} sets the alignment of variables for display editing
611 purposes. This only has effect for third party software. It does not affect
612 the display of variables in the @pspp{} output.
618 @section VARIABLE WIDTH
619 @vindex VARIABLE WIDTH
622 @var{var_list} (width)
623 [ /@var{var_list} (width) ]
627 [ /@var{var_list} (width) ]
630 @cmd{VARIABLE WIDTH} sets the column width of variables for display editing
631 purposes. This only affects third party software. It does not affect
632 the display of variables in the @pspp{} output.
636 @section VARIABLE LEVEL
637 @vindex VARIABLE LEVEL
640 @var{var_list} ( SCALE | NOMINAL | ORDINAL )
641 [ /@var{var_list} ( SCALE | NOMINAL | ORDINAL ) ]
645 [ /@var{var_list} ( SCALE | NOMINAL | ORDINAL ) ]
648 @cmd{VARIABLE LEVEL} sets the measurement level of variables.
649 Currently, this has no effect except for certain third party software.
653 @section VARIABLE ROLE
654 @vindex VARIABLE ROLE
657 /@var{role} @var{var_list}
658 [/@var{role} @var{var_list}]@dots{}
661 @cmd{VARIABLE ROLE} sets the intended role of a variable for use in
662 dialog boxes in graphical user interfaces. Each @var{role} specifies
663 one of the following roles for the variables that follow it:
667 An input variable, such as an independent variable.
670 An output variable, such as an dependent variable.
673 A variable used for input and output.
676 No role assigned. (This is a variable's default role.)
679 Used to break the data into groups for testing.
682 No meaning except for certain third party software. (This role's
683 meaning is unrelated to @cmd{SPLIT FILE}.)
686 The PSPPIRE GUI does not yet use variable roles as intended.
693 Two possible syntaxes:
694 VECTOR @var{vec_name}=@var{var_list}.
695 VECTOR @var{vec_name_list}(@var{count} [@var{format}]).
698 @cmd{VECTOR} allows a group of variables to be accessed as if they
699 were consecutive members of an array with a vector(index) notation.
701 To make a vector out of a set of existing variables, specify a name
702 for the vector followed by an equals sign (@samp{=}) and the variables
703 to put in the vector. All the variables in the vector must be the same
704 type. String variables in a vector must all have the same width.
706 To make a vector and create variables at the same time, specify one or
707 more vector names followed by a count in parentheses. This will cause
708 variables named @code{@var{vec}1} through @code{@var{vec}@var{count}}
709 to be created as numeric variables. By default, the new variables
710 have print and write format F8.2, but an alternate format may be
711 specified inside the parentheses before or after the count and
712 separated from it by white space or a comma. Variable names including
713 numeric suffixes may not exceed 64 characters in length, and none of
714 the variables may exist prior to @cmd{VECTOR}.
716 Vectors created with @cmd{VECTOR} disappear after any procedure or
717 procedure-like command is executed. The variables contained in the
718 vectors remain, unless they are scratch variables (@pxref{Scratch
721 Variables within a vector may be referenced in expressions using
722 @code{vector(index)} syntax.
725 @section WRITE FORMATS
726 @vindex WRITE FORMATS
729 WRITE FORMATS @var{var_list} (@var{fmt_spec}) [@var{var_list} (@var{fmt_spec})]@dots{}.
732 @cmd{WRITE FORMATS} sets the write formats for the specified variables
733 to the specified format specification. Its syntax is identical to
734 that of @cmd{FORMATS} (@pxref{FORMATS}), but @cmd{WRITE FORMATS} sets only
735 write formats, not print formats.