Differences between revisions 1 and 11 (spanning 10 versions)
Revision 1 as of 2022-09-25 20:15:13
Size: 251
Comment:
Revision 11 as of 2025-10-24 16:26:08
Size: 1301
Comment: Rewrite
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= Stata Aggregating Data = = Aggregating Data with Stata =

Stata offers several commands for computing aggregated statistics and translating datasets into aggregated formats.
Line 9: Line 11:
== Contract == == Statistics ==
Line 11: Line 13:
'''`collapse`''' is ideal for aggregated statistics. The [[Stata/Summarize|-summarize-]] command computes and [[Stata/StoredResults|stores]] descriptive statistics.

The [[Stata/Inspect|-inspect-]] command is useful for interactive exploration.

The [[Stata/Contract|-contract-]] and [[Stata/Collapse|-collapse-]] commands create datasets of aggregated statistics. The former is useful for descriptive statistics, while the latter is designed for summary statistics.
Line 14: Line 20:
contract VAR, freq(Count) percent(Percentage) contract foo, freq(Count) percent(Percentage)
Line 21: Line 27:
== Collapse == == Wide and Long Data ==

The [[Stata/Reshape|-reshape-]] command can be used to translate datasets between wide and long formats.

To translate into a wide format, try:

{{{
reshape wide VARSTUB, i(KEYVAR) j(GROUPVAR)
}}}

A series of variables named like `VARSTUB*` will be created, for each group of `GROUPVAR`.

To translate into a long format, try:

{{{
reshape long VARSTUB, i(KEYVAR) j(GROUPVAR)
}}}

The variable `VARSTUB` will be created from the variable list `VARSTUB*`, and `GROUPVAR` will be created to indicate the source of `VARSTUB`.


Aggregating Data with Stata

Stata offers several commands for computing aggregated statistics and translating datasets into aggregated formats.


Statistics

The -summarize- command computes and stores descriptive statistics.

The -inspect- command is useful for interactive exploration.

The -contract- and -collapse- commands create datasets of aggregated statistics. The former is useful for descriptive statistics, while the latter is designed for summary statistics.

contract foo, freq(Count) percent(Percentage)


Wide and Long Data

The -reshape- command can be used to translate datasets between wide and long formats.

To translate into a wide format, try:

reshape wide VARSTUB, i(KEYVAR) j(GROUPVAR)

A series of variables named like VARSTUB* will be created, for each group of GROUPVAR.

To translate into a long format, try:

reshape long VARSTUB, i(KEYVAR) j(GROUPVAR)

The variable VARSTUB will be created from the variable list VARSTUB*, and GROUPVAR will be created to indicate the source of VARSTUB.


CategoryRicottone

Stata/AggregatingData (last edited 2025-10-24 16:26:08 by DominicRicottone)