Size: 6538
Comment:
|
Size: 8283
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
## page was renamed from SPSS/AggregateFunctions | |
Line 15: | Line 16: |
Generally, missing values are ignored. String values will never be considered missing. Generally, a missing value are only returned if ''all'' values are missing. The exception is `SD`, which requires two non-missing values. PSPP extends this syntax with handling for user missing values. To include this type of missing values on any aggregate function, append the function name with a period. For example, `SUM.`. |
|
Line 21: | Line 28: |
Only the `MAX`, `MIN`, `FIRST`, and `LAST` functions copy the metadata of a source variable. All other created variables lack metadata by default. | Only the `MAX`, `MIN`, `FIRST`, and `LAST` functions copy the metadata of a source variable. All other created variables lack labels and have a format pre-determined by the function. * `F5.3` for functions `FGT`, `FIN`, `FLT`, and `FOUT` * `F5.1` for functions `PGT`, `PIN`, `PLT`, and `POUT` * `F7.0` for functions `NU` and `NUMISS` * `F8.2` for functions `CGT`, `CIN`, `CLT`, `COUT`, `MEAN`, `MEDIAN`, `SD`, `SUM`, `N`, and `NMISS` * If weighting is not enabled, `N` and `NMISS` create variables with a format of `F7.0`. In other words, `N` and `NMISS` match the behavior of `NU` and `NUMISS` when weighting is disabled. |
Line 37: | Line 52: |
Note: unsupported in SPSS version 21 or earlier. |
|
Line 49: | Line 66: |
Note: unsupported in SPSS version 21 or earlier. |
|
Line 57: | Line 76: |
Note: unsupported in SPSS version 21 or earlier. |
|
Line 71: | Line 92: |
Note: unsupported in SPSS version 21 or earlier. |
|
Line 97: | Line 120: |
The '''`FIRST`''' function returns the first non-missing value in a break group. TODO: what happens if specified with a variable list argument? |
The '''`FIRST`''' function returns the first non-missing value for each source variable. String values will never be considered missing. |
Line 129: | Line 150: |
The '''`LAST`''' function returns the last non-missing value in a break group. TODO: what happens if specified with a variable list argument? |
The '''`LAST`''' function returns the lastnon-missing value for each source variable. String values will never be considered missing. ---- == Max == The '''`MAX`''' function returns a maximum non-missing value for each source variable. String values are evaluated according to codepoints. For example, `"Z"` has a higher codepoint than `"A"`, so between the two values the maximum value is `"Z"`. String values will never be considered missing. |
Line 147: | Line 176: |
== Max == The '''`MAX`''' function returns a maximum value for each source variable. ---- |
|
Line 159: | Line 180: |
Note: only valid for numeric variables. |
|
Line 165: | Line 188: |
The '''`MIN`''' function returns a minimum value for each source variable. | The '''`MIN`''' function returns a minimum non-missing value for each source variable. String values are evaluated according to codepoints. For example, `"A"` has a lower codepoint than `"Z"`, so between the two values the minimum value is `"A"`. String values will never be considered missing. |
Line 175: | Line 200: |
If specified with a variable list argument, the `N` function returns a weighted number of cases with non-missing values for each source variable. | If specified with a variable list argument, the `N` function returns a weighted number of cases with non-missing values for each source variable. String values will never be considered missing. |
Line 183: | Line 208: |
The '''`NMISS`''' function returns a weighted number of missing cases in a break group. TODO: what happens if specified with a variable list argument? |
The '''`NMISS`''' function returns a weighted number of cases with missing values for each source variable. String values will never be considered missing. |
Line 195: | Line 218: |
If specified with a variable list argument, the `NU` function returns an unweighted number of cases with non-missing values for each source variable. | If specified with a variable list argument, the `NU` function returns an unweighted number of cases with non-missing values for each source variable. String values will never be considered missing. |
Line 203: | Line 226: |
The '''`NUMISS`''' function returns an unweighted number of missing cases in a break group. TODO: what happens if specified with a variable list argument? |
The '''`NUMISS`''' function returns an unweighted number of cases with missing values for each source variable. String values will never be considered missing. |
SPSS Aggregate Functions
These are the functions that can be used on the AGGREGATE command.
Contents
General Syntax
The number of target variables must match the number of source variables.
Generally, missing values are ignored. String values will never be considered missing.
Generally, a missing value are only returned if all values are missing. The exception is SD, which requires two non-missing values.
PSPP extends this syntax with handling for user missing values. To include this type of missing values on any aggregate function, append the function name with a period. For example, SUM..
Variable Metadata
Only the MAX, MIN, FIRST, and LAST functions copy the metadata of a source variable.
All other created variables lack labels and have a format pre-determined by the function.
F5.3 for functions FGT, FIN, FLT, and FOUT
F5.1 for functions PGT, PIN, PLT, and POUT
F7.0 for functions NU and NUMISS
F8.2 for functions CGT, CIN, CLT, COUT, MEAN, MEDIAN, SD, SUM, N, and NMISS
If weighting is not enabled, N and NMISS create variables with a format of F7.0. In other words, N and NMISS match the behavior of NU and NUMISS when weighting is disabled.
To specify a variable label for a new target variable, list the label in quotes following the new variable name.
Value labels cannot be specified.
Variable formats cannot be specified.
Cgt
The CGT function returns a count of cases with a value greater than a specified second argument for each source variable.
Note: unsupported in SPSS version 21 or earlier.
Cin
The CIN function returns a count of cases with a value within some range for each source variable.
The range is specified by the second and third arguments, and it is inclusive of those two values as well. If the second argument is greater than the third, they are automatically reversed.
If the second and third arguments are equal, CIN returns a count of cases with a value equal to the second argument.
Note: unsupported in SPSS version 21 or earlier.
Clt
The CLT function returns a count of cases with a value lesser than a specified second argument for each source variable.
Note: unsupported in SPSS version 21 or earlier.
Cout
The COUT function returns a count of cases with a value outside of some range for each source variable.
The range is specified by the second and third arguments, and it is inclusive of those two values as well. If the second argument is greater than the third, they are automatically reversed.
If the second and third arguments are equal, COUT returns a count of cases with a value not equal to the second argument.
COUT is complementary of CIN.
Note: unsupported in SPSS version 21 or earlier.
Fgt
The FGT function returns a fraction of cases with a value greater than a specified second argument for each source variable.
Fin
The FIN function returns a fraction of cases with a value within some range for each source variable.
The range is specified by the second and third arguments, and it is inclusive of those two values as well. If the second argument is greater than the third, they are automatically reversed.
If the second and third arguments are equal, FIN returns a fraction of cases with a value equal to the second argument.
First
The FIRST function returns the first non-missing value for each source variable. String values will never be considered missing.
Flt
The FLT function returns a fraction of cases with a value lesser than a specified second argument for each source variable.
Fout
The FOUT function returns a percentage of cases with a value outside of some range for each source variable.
The range is specified by the second and third arguments, and it is inclusive of those two values as well. If the second argument is greater than the third, they are automatically reversed.
If the second and third arguments are equal, FOUT returns a fraction of cases with a value not equal to the second argument.
FOUT is complementary of FIN.
Last
The LAST function returns the lastnon-missing value for each source variable. String values will never be considered missing.
Max
The MAX function returns a maximum non-missing value for each source variable.
String values are evaluated according to codepoints. For example, "Z" has a higher codepoint than "A", so between the two values the maximum value is "Z". String values will never be considered missing.
Mean
The MEAN function returns a mean across cases for each source variable.
Note: only valid for numeric variables.
Median
The MEDIAN function returns a median value for each source variable.
Note: only valid for numeric variables.
Min
The MIN function returns a minimum non-missing value for each source variable.
String values are evaluated according to codepoints. For example, "A" has a lower codepoint than "Z", so between the two values the minimum value is "A". String values will never be considered missing.
N
The N function returns a weighted number of cases in a break group.
If specified with a variable list argument, the N function returns a weighted number of cases with non-missing values for each source variable. String values will never be considered missing.
Nmiss
The NMISS function returns a weighted number of cases with missing values for each source variable. String values will never be considered missing.
Nu
The NU function returns an unweighted number of cases in a break group.
If specified with a variable list argument, the NU function returns an unweighted number of cases with non-missing values for each source variable. String values will never be considered missing.
Numiss
The NUMISS function returns an unweighted number of cases with missing values for each source variable. String values will never be considered missing.
Pgt
The PGT function returns a percentage of cases with a value greater than a specified second argument for each source variable.
Pin
The PIN function returns a percentage of cases with a value within some range for each source variable.
The range is specified by the second and third arguments, and it is inclusive of those two values as well. If the second argument is greater than the third, they are automatically reversed.
If the second and third arguments are equal, PIN returns a percentage of cases with a value equal to the second argument.
Plt
The PLT function returns a percentage of cases with a value lesser than a specified second argument for each source variable.
Pout
The POUT function returns a percentage of cases with a value outside of some range for each source variable.
The range is specified by the second and third arguments, and it is inclusive of those two values as well. If the second argument is greater than the third, they are automatically reversed.
If the second and third arguments are equal, POUT returns a percentage of cases with a value not equal to the second argument.
POUT is complementary of PIN.
SD
The SD function returns a standard deviation across cases for each source variable.
Note: only valid for numeric variables.
Sum
The SUM function returns a sum across cases for each source variable.
Note: only valid for numeric variables.