Differences between revisions 4 and 7 (spanning 3 versions)
Revision 4 as of 2023-01-13 23:31:51
Size: 2240
Comment:
Revision 7 as of 2023-06-07 19:49:48
Size: 1095
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:

Stata offers several commands for computing higher-level statistics and translating datasets into aggregated formats.
Line 9: Line 11:
== Contract == == Contract and Collapse ==
Line 11: Line 13:
The '''`collapse`''' command is ideal for aggregated statistics. The [[Stata/Contract|contract]] and [[Stata/Collapse|collapse]] commands are ideal for aggregated statistics. The former is useful for descriptive statistics, while the latter is designed for summary statistics.
Line 14: Line 16:
contract VAR, freq(Count) percent(Percentage) contract foo, freq(Count) percent(Percentage)
Line 16: Line 18:

----



== Collapse ==
Line 29: Line 25:
The '''`reshape`''' command can be used to move data between wide and tall formats. It has additional, helpful features, such as a diagnostic `reshape errors` command. The [[Stata/Reshape|reshape]] command can be used to translate datasets between wide and tall formats.
Line 31: Line 27:


=== Reshape Wide ===

To expand long data into a wide format, try:
To translate into a wide format, try:
Line 43: Line 35:
If the group indicator should be placed anywhere other than as a suffix to `VARSTUB`, use a single at siogn (`@`) to indicate the placement. For example, if the groups are `1`, `2`, and `3`, then the command `reshape wide inc@r, i(KEYVAR) j(GROUPVAR)` would create variable `inc1r`, `inc2r`, and `inc3r`.

If `GROUPVAR` is a string variable, the `string` option is mandatory.

If data has been transformed through a `reshape wide` command like above, then to restore data to the long format, try:

{{{
reshape long
}}}

The parameters of the transformation are stored and reused between calls.



=== Reshape Long ===

To contract wide data into a long format, try:
To translate into a long format, try:
Line 65: Line 41:
...`VARSTUB` will be created from the variable list `VARSTUB*`, and `GROUPVAR` will be created to indicate the source of `VARSTUB`. If the `string` option is specified, `GROUPVAR` will be a string variable.

If the variable list does not follow the simple pattern of `VARSTUB*`, it may be possible to specify with an at sign (`@`). For example, if the target variables are `inc1r`, `inc2r`, and `inc3r` and the intended `GROUPVAR` groups are `1`, `2`, and `3`, then the command `reshape long inc@r, i(KEYVAR) j(GROUPVAR)` would correctly create the variable `incr`.

If data has been transformed through a `reshape long` command like above, then to restore data to the wide format, try:

{{{
reshape wide
}}}

The parameters of the transformation are stored and reused between calls.
The variable `VARSTUB` will be created from the variable list `VARSTUB*`, and `GROUPVAR` will be created to indicate the source of `VARSTUB`.

Aggregating Data with Stata

Stata offers several commands for computing higher-level statistics and translating datasets into aggregated formats.


Contract and Collapse

The contract and collapse commands are ideal for aggregated statistics. The former is useful for descriptive statistics, while the latter is designed for summary statistics.

contract foo, freq(Count) percent(Percentage)


Reshape

The reshape command can be used to translate datasets between wide and tall formats.

To translate into a wide format, try:

reshape wide VARSTUB, i(KEYVAR) j(GROUPVAR)

A series of variables named like VARSTUB* will be created, for each group of GROUPVAR.

To translate into a long format, try:

reshape long VARSTUB, i(KEYVAR) j(GROUPVAR)

The variable VARSTUB will be created from the variable list VARSTUB*, and GROUPVAR will be created to indicate the source of VARSTUB.


CategoryRicottone

Stata/AggregatingData (last edited 2023-06-09 16:51:50 by DominicRicottone)