UK DATA ARCHIVE: IMPORTANT STUDY INFORMATION

Study Number 4642 - Scottish Household Survey, 2001-2002


NEW EDITION INFORMATION

For the fourth edition, the depositor supplied five Scottish Index of Multiple Deprivation (SIMD) variables, which have been added to the main data file. The variables relate to the Scottish Executive's urban rural classification and the Scottish Index of Multiple Deprivation (SIMD) and rural/urban classification. Versions of the urban/rural variables were already on each data file using the settlement definitions that were current when the files were created. The new variables are a version of the classification using the 2003 settlement definitions, giving a consistent basis for analysis of change over time. These frozen variables (RURFROZ6 and RURFROZ8) will also be added to future data files. The SIMD variables (MD05PC15, MD05DEC and MD05QUIN) are named MD05 even though they relate to the 2004 index. They reflect revisions made in 2005 and since the original version on the data were named MD04, these new variables have been given a different name. Users can find more information at the Scottish Executive Urban Rural Classification and Scottish Index of Multiple Deprivation 2004 web pages.

DATA PROCESSING NOTES


Data Archive Processing Standards

The data were processed to the UK Data Archive's A* standard. This is the Archive's highest standard, and means that an extremely rigorous and comprehensive series of checks was carried out to ensure the quality of the data and documentation.Firstly, checks were made that the number of cases and variables matched the depositor's records. Secondly, checks were made that all variables had comprehensible variable labels and all nominal (categorical) variables had comprehensible value labels. Where possible, either with reference to the documentation and/or in communication with the depositor, labels were accordingly edited or created. Thirdly, logical checks were performed to ensure that nominal (categorical) variables had values within the range defined (either by value labels or in the depositor's documentation). Lastly, any data or documentation that breached confidentiality rules were altered or suppressed to preserve anonymity.

All notable and/or outstanding problems discovered are detailed under the 'Data and documentation problems' heading below.

Data and documentation problems

Travel Diary Data:

a. Updated 2001 Travel Diary data (added Jan 2004):
Files dajn01 and dast01:
i. ENDHR (End Time (hr) of journey) - these variables includes isolated out-of-range values of '25'.
ii. OCODE/DCODE (Type of origin/destination postcode): these variables contain significant numbers of missing values.

b. 2002 Travel Diary Data (added Jan 2004):
Files dajn02 and dast02:
i. ENDHR (End Time (hr) of journey) - these variables includes isolated out-of-range values of '25' and '26'.
ii. OCODE/DCODE (Type of origin/destination postcode): these variables contain significant numbers of missing values.

Main data file:
Variables RCLASS and HCLASS both contain one out-of-range value of '8532'.

The documentation states that the data file contains 2075 variables and 30,638 cases. It actually contains 2081 variables and 30,639 cases.

Useful Notes

Amended variable labels for RE10a to RE10v were supplied by the depositor in 2004. These variables should therefore be labelled as follows:
re10a "RE10 Blind / poor eyesight / eye problems"
re10b "RE10 Can't afford driving lessons"
re10c "RE10 Can't afford a car"
re10d "RE10 Can't afford to run a car"
re10e "RE10 Disability"
re10f "RE10 Don't have a car"
re10g "RE10 Don't have a licence"
re10h "RE10 Failed test"
re10i "RE10 Health problems"
re10j "RE10 Never learned to drive"
re10k "RE10 Never wanted to / not interested"
re10l "RE10 No road sense"
re10n "RE10 Prefer to walk"
re10o "RE10 Too much traffic"
re10p "RE10 Too nervous / lack confidence"
re10q "RE10 Too old"
re10r "RE10 Too young"
re10s "RE10 Other"
re10t "RE10 Don't need to drive"
re10u "RE10 Never got round to it / no time to learn"
re10v "RE10 Banned / lost licence"

Download Service users - Stata:
As the main data file for this study contains more than 2047 variables, the Stata version of this file is only available in Stata8 SE format. The travel diary files are much smaller, and readable in Stata6 or Stata7. They have however been included in the Stata8_SE zip file along with the main file, for ease of download.

For further information on the Scottish Household Survey, users should consult the Scottish Household Survey web site.

Data conversion information

From January 2003 onwards, almost all data conversions have been performed using software developed by the UKDA. This enables standardisation of the conversion methods and ensures optimal data quality. In addition to its own data processing/conversion code, this software uses the SPSS and Stat/Transfer command processors to perform certain format translations. Although data conversion is automated, all data files are also subject to visual inspection by a UKDA data processing officer.

With some format conversions data, and more especially internal metadata (i.e. variable labels, value labels, missing value definitions, data type information), will inevitably be lost or truncated owing to the differential limits of the proprietary formats.A UKDA Data Dictionary file (in rich text format), corresponding to each data file, is usually provided for viewing and searching the internal metadata as it existed in the originating format. These files are called: [data file name]_UKDA_Data_Dictionary.rtf

Important information about the data format supplied

The links below provide important information about the format in which you have been supplied the data. Some of this information is specific to the ingest format of the data, that is the format in which the UKDA was supplied the data in. The ingest format for this study was SPSS

Please click below to find out information about the format that you have been supplied the data in.

SPSS (*.por)

STATA (*.dta)
Tab-delimited text (*.tab)
MS Excel (*.xls files)
SAS (supplied as *.dat and *.sas)
MS Access (*.mdb files)

Conversion of documentation formats

Electronic and paper documentation supplied with this study is usually incorporated into the UKDA User Guide (in PDF format). The conversion programmes used are the latest versions of Adobe PDF Writer for electronic documentation and Adobe Paper Capture (Acrobat 'plugin' version) for paper documentation. Occasionally, someof the electronic documentation cannot be usefully converted to PDF (e.g. MS Excel files with wide worksheets) and this is supplied ina more appropriate format. All User Guides are fully bookmarked.