Size: 257
Comment:
|
← Revision 10 as of 2023-06-09 16:51:50 ⇥
Size: 1282
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
= Stata Aggregating Data = | = Aggregating Data with Stata = Stata offers several commands for computing aggregated statistics and translating datasets into aggregated formats. |
Line 9: | Line 11: |
== Contract == | == Statistics == |
Line 11: | Line 13: |
'''`collapse`''' is ideal for aggregated statistics. | The [[Stata/Summarize|summarize]] command computes and [[Stata/StoredResults|stores]] descriptive statistics. The [[Stata/Inspect|inspect]] command is useful for interactive exploration. The [[Stata/Contract|contract]] and [[Stata/Collapse|collapse]] create datasets of aggregated statistics. The former is useful for descriptive statistics, while the latter is designed for summary statistics. |
Line 14: | Line 20: |
contract VAR, freq(Count) percent(Percentage) | contract foo, freq(Count) percent(Percentage) |
Line 21: | Line 27: |
== Collapse == | == Wide and Long Data == The [[Stata/Reshape|reshape]] command can be used to translate datasets between wide and long formats. To translate into a wide format, try: {{{ reshape wide VARSTUB, i(KEYVAR) j(GROUPVAR) }}} A series of variables named like `VARSTUB*` will be created, for each group of `GROUPVAR`. To translate into a long format, try: {{{ reshape long VARSTUB, i(KEYVAR) j(GROUPVAR) }}} The variable `VARSTUB` will be created from the variable list `VARSTUB*`, and `GROUPVAR` will be created to indicate the source of `VARSTUB`. |
Aggregating Data with Stata
Stata offers several commands for computing aggregated statistics and translating datasets into aggregated formats.
Statistics
The summarize command computes and stores descriptive statistics.
The inspect command is useful for interactive exploration.
The contract and collapse create datasets of aggregated statistics. The former is useful for descriptive statistics, while the latter is designed for summary statistics.
contract foo, freq(Count) percent(Percentage)
Wide and Long Data
The reshape command can be used to translate datasets between wide and long formats.
To translate into a wide format, try:
reshape wide VARSTUB, i(KEYVAR) j(GROUPVAR)
A series of variables named like VARSTUB* will be created, for each group of GROUPVAR.
To translate into a long format, try:
reshape long VARSTUB, i(KEYVAR) j(GROUPVAR)
The variable VARSTUB will be created from the variable list VARSTUB*, and GROUPVAR will be created to indicate the source of VARSTUB.