study description These data are from a hypothetical study of birthweight and the associated characteristics of the baby and the mother. The data...
Data for assignment 3—data description Background: study description These data are from a hypothe±cal study of birthweight and the associated characteris±cs of the baby and the mother. The data set consists of 200 records for the data cleaning set and 100 records for the analysis set each containing informa±on on the baby’s sex and birthweight and the mother’s smoking status, body mass index (bmi) and age. The data have been selected in such a way that we can assume they are independent. For this study, smoking status is taken to mean whether or not the mother smoked during the ²rst trimester of pregnancy. A low birthweight is de²ned as a weight less than or equal to 2,500g. Body mass index is a measure for human body shape based on an individual's weight and height. For this study it is de²ned as the individual's body mass in kilograms divided by the square of their height in metres. So, its formula is BMI = mass ( kg ) ( height ( m ) ) 2 The mothers’ BMI was assessed at the start of the pregnancy. These are synthe±c data, but they can be referenced as coming from: assignment 3 data: birthweights Variables included in the data set Variable name Descrip±on Units Range ID Unique iden±²ca±on number for each par±cipant maternal age Age of the mother at the ±me of the birth Years 18–42* bmi Body mass index of the mother at the start of the pregnancy. kg/m 2 19.4–42.5* smoking status Whether or not the mother smoked during the ²rst trimester of pregnancy 1 = mother smoked during ²rst trimester of pregnancy 0 = mother did not smoke during the ²rst trimester of pregnancy sex Sex of baby 1 = male 0 = female low birthweight Whether or not the baby has a low birthweight (<= 2,500g) 1 = low birthweight 0 = not low birthweight birth weight Baby’s birthweight grams 2139.2– 3089.6* * - This represents the maximum possible range for the data. This does not mean that each data set contains the whole allowable range.
qattachments_866e1fa2a34a8ce2a87bd76e5b791016b904f883 Page 1 ID maternal age bmi smoking status sex low birthweight birth weight 1 32 30.5 0 1 0 2742.6 2 29 26.5 0 1 0 2577.4 3 26 24.5 0 0 0 2574.4 4 33 29.5 0 1 0 2752 5 27 24 1 1 1 2344.2 6 27 22 0 1 0 2550.9 7 23 32.6 1 0 1 2353.6 8 31 26.9 0 1 0 2696.5 9 29 27.7 1 1 1 2460.4 10 25 35.6 1 1 0 2514.6 11 31 32.8 0 0 0 2868.5 12 32 25.3 1 1 0 2549.5 13 25 29.3 0 1 0 2574.6 14 29 22.5 0 1 0 2512 15 28 28.6 1 1 0 2529.9 16 37 27.1 0 0 0 2780.3 17 30 27.9 1 0 1 2422.7 18 24 29.9 0 1 0 2636.2 19 32 26 0 0 0 2641 20 30 29.1 0 1 0 2702.5 21 36 28.2 1 1 0 2625.9 22 31 24.6 0 0 0 2689.9 23 33 27.6 1 1 1 2447.3 24 31 22.3 0 1 0 2565.1 25 30 23.7 0 1 0 2606.3 26 32 21.5 0 1 0 2614.8 27 33 31.1 1 0 0 2646.5 28 38 32 0 1 0 2907.8 29 32 36.7 1 1 0 2656.9 30 32 28 0 1 0 2782.5 31 39 33 0 0 0 2926.3 32 35 30.7 0 0 0 2899.3 33 28 22.3 0 1 1 2466.3 34 31 31.8 1 0 0 2604.8 35 37 19.9 0 0 0 2685.7 36 35 28.7 0 1 0 2796.3 37 34 27.1 0 1 0 2829.1 38 28 30.9 1 1 1 2496.9 39 24 33.7 1 1 1 2410.8 40 24 27.6 1 0 1 2279.3 41 32 27.9 0 1 0 2791.2 42 24 27.9 1 0 1 2426.9 43 36 21.7 1 0 1 2441.3 44 34 30.9 1 1 0 2574.7 45 35 27.7 0 1 0 2777.1 46 29 32.2 0 1 0 2739.2 47 33 35.8 0 1 0 2851.6 48 29 25.7 0 1 0 2634.6 49 30 39.3 0 0 0 2927.6 50 28 27.6 0 0 0 2654 51 35 35 0 1 0 2895.2 52 34 24 0 1 0 2706.7
qattachments_866e1fa2a34a8ce2a87bd76e5b791016b904f883 Page 2 53 32 31.4 0 1 0 2804 54 26 32.7 0 0 0 2720.1 55 26 34.6 1 1 0 2599 56 20 30.4 0 0 0 2564.5 57 30 29.2 1 1 1 2486.1 58 30 32.6 0 1 0 2800.9 59 35 36.6 0 1 0 2857.5 60 30 29.8 0 1 0 2698.3 61 37 33.1 0 1 0 2925.5 62 34 27 1 1 1 2484.8 63 30 26.5 1 1 1 2425 64 28 26.9 1 0 1 2393.3 65 32 33.5 0 1 0 2923 66 23 32 0 0 0 2736.8 67 31 25.5 0 1 0 2675.5 68 30 31.7 0 1 0 2744.9 69 36 30.8 0 1 0 2834.5 70 31 29.5 0 1 0 2728.7 71 36 29.9 1 1 0 2598 72 31 34.9 0 1 0 2850.4 73 32 26.8 1 1 0 2522.2 74 31 37.3 0 0 0 2849.4 75 35 29.9 0 1 0 2806.7 76 35 25.7 0 0 0 2735.2 77 37 26.1 0 0 0 2757.3 78 18 28.5 0 0 1 2411.8 79 31 32.9 0 1 0 2874.3 80 35 30 1 0 0 2667.4 81 37 30 0 0 0 2931.9 82 29 28.5 0 1 0 2706.2 83 28 27.2 0 0 0 2648.3 84 31 21.6 0 1 0 2560.9 85 28 34.1 0 0 0 2788.3 86 32 27.6 0 0 0 2765.3 87 34 33.8 1 0 0 2696.3 88 33 29.4 1 0 0 2644.8 89 31 38.7 1 1 0 2697.9 90 33 31.9 0 0 0 2803.7 91 42 31.3 0 1 0 2976.3 92 27 21.7 0 1 1 2423.4 93 31 29.6 0 0 0 2743.7 94 27 31 0 0 0 2737.3 95 32 32.7 0 0 0 2881.2 96 32 22.9 0 1 0 2644.7 97 42 25 0 0 0 2881.7 98 32 34.9 0 1 0 2895.4 99 36 27.3 1 1 0 2639.4 100 38 24.9 1 1 0 2640
qattachments_962f3340b55a66a5c397ea5f64016d007c07acee Page 1 ID maternal age bmi smoking status sex low birthweight birth weight 1 23 26.6 1 0 1 2370.7 2 26 20.5 1 0 1 2150.4 3 29 30.7 1 1 1 2498.9 4 31 27.9 1 1 0 2504.3 5 29 35.7 1 0 0 2608.6 6 33 23.7 0 1 0 2691.1 7 25 27.1 0 0 0 2534.4 8 32 36.6 0 1 0 2875.9 9 37 33.7 0 0 0 2961.5 10 30 28.6 0 1 0 2658.5 11 27 32.1 0 1 0 2698.4 12 36 34.9 1 0 0 2785.4 13 34 29.6 0 0 0 2888.3 14 34 27.4 1 1 0 2671.2 15 28 29 0 1 0 2673.9 16 33 36 0 0 0 2833.2 17 37 22 0 1 0 2705 18 36 25.5 0 0 0 2803.8 19 31 21.9 0 0 0 2501 20 27 23.9 0 0 0 2539.2 21 38 28.7 0 1 0 2850 22 30 38.9 0 0 0 2818.2 23 29 31 0 0 0 2756.9 24 26 31.3 1 1 1 2457.7 25 32 26.2 1 1 1 2473.3 26 25 25.7 0 1 0 2611.7 27 39 24.4 0 1 0 2768.9 28 31 34.4 1 1 0 2651.2 29 22 29.1 1 1 1 2349.1 30 31 27.5 0 1 0 2674.7 31 29 29.2 0 1 0 2696.4 32 30 26.7 0 0 0 2620.5 33 30 28.2 1 1 1 2401.1 34 38 23 0 1 0 2827.7 35 31 33.2 0 1 0 2868.6 36 29 32.6 0 1 0 2724 37 29 28.9 1 0 0 2534.4 38 37 25.2 0 0 0 2680.5 39 29 21.3 0 0 0 2556.6 40 37 30 0 1 0 2815.1 41 29 25.7 0 0 0 2725.6 42 31 37.1 0 1 0 2774.1 43 22 33.3 0 0 0 2653.1 44 22 36.9 0 1 0 2772.2 45 20 23.5 1 0 1 2223.3 46 34 28.1 0 0 0 2737 47 26 29.2 1 0 1 2387.3 48 29 26.7 1 0 1 2476.6 49 23 35.1 1 0 0 2500.4 50 26 24.8 0 1 0 2577.3 51 27 28.6 0 0 0 2671.7 52 25 25.5 0 1 1 2479.7
qattachments_962f3340b55a66a5c397ea5f64016d007c07acee Page 2 53 33 27.4 0 1 0 2738.6 54 33 25.6 0 0 0 2706 55 25 29 0 1 0 2626.6 56 37 32.2 1 1 0 2689.1 57 27 31.3 0 1 0 2597.8 58 35 34 0 1 0 2988.5 59 31 30.8 0 0 0 2744.1 60 28 39.7 1 1 0 2786 61 33 35.2 0 1 0 2902 62 31 29.9 0 0 0 2804.4 63 25 27.5 0 0 1 2461.1 64 36 29.4 0 0 0 2923 65 31 26.3 0 1 0 2648.9 66 33 24.2 1 1 1 2448.7 67 28 25.6 1 0 1 2376.7 68 28 29.9 0 0 0 2695.2 69 280 25 0 1 0 2562.7 70 28 36.2 0 1 0 2797.7 71 26 29.6 0 1 0 2754.3 72 31 23.7 0 0 0 2620.3 73 31 36.4 0 0 0 2880.9 74 30 25.8 0 0 0 2702.8 75 28 27.5 0 0 0 2623.4 76 34 31.6 0 0 0 2834.5 77 29 26.2 1 0 1 2392.5 78 28 34.4 1 1 0 2567.8 79 30 34 0 1 0 2831.2 80 31 30.3 1 0 0 2560.9 81 34 27.8 0 0 0 2776.6 82 25 27.1 0 1 1 2491.1 83 25 31.6 0 1 0 2596.3 84 32 27.7 1 1 1 2498.1 85 34 30.9 0 1 0 2908.7 86 30 23 0 1 0 2607.6 87 34 32.3 1 0 0 2569 88 35 22.4 0 0 0 2668.9 89 34 24 0 0 0 2674.9 90 28 29.9 0 0 0 2626 91 30 36.9 1 1 0 2809.6 92 32 22.3 0 1 0 2608.2 93 20 21.7 1 0 1 2139.2 94 26 27.3 1 1 1 2353.9 95 27 2.5 0 0 0 2574.7 96 28 26.9 0 0 0 2614.6 97 28 24.9 0 0 0 2622.8 98 23 25.4 0 0 1 2498.3 99 35 25.7 0 1 0 2737.3 100 34 25.3 0 1 0 2642.6 101 26 20.6 1 1 1 2328.2 102 31 26.3 1 1 1 2418.7 103 35 19.9 1 1 1 2349.7 104 27 26.1 0 0 0 2635.6 105 30 29.9 0 0 0 2671
Questions 1. Data cleaning. You have received data from a hypotheTcal study of birthweight and the associated characterisTcs of the baby and the mother. ±he data set consists of 200 records containing informaTon on the baby’s sex and birthweight and the mother’s smoking status, body mass index (bmi) and age. A complete descripTon of the data is in the document assignment 3 data descripTon. You should read this document before starTng your data analysis. ±he data are in the comma-separated Fle birthweights data cleaning.csv . ±hese data were collected on paper forms and transferred to a computer Fle before being given to you. Your task is to examine them for any invalid or inconsistent data and to prepare the data set for analysis. Report on your Fndings – what data were incorrect and what did you do with the incorrect data. (5 marks) 2. The relationship between baby’s birthweight and the baby’s sex and the mother’s smoking status. You receive a second data set from the same hypotheTcal study of birthweight and the associated characterisTcs of the baby and the mother as described in quesTon 1. However, these data are in a computer Fle and already prepared for analysis. ±he document assignment 3 data descripTon also describes this data set. ±he data are in the comma-separated Fle birthweights analysis.csv . You wish to invesTgate the relaTonship between the baby’s birthweight and the baby’s sex and the mother’s smoking status. 2.1 The relationship between baby’s sex and birthweight. Carry out a hypothesis test to see if mean birthweight is di²erent for boy babies compared to girl babies, fully reporTng on the test and its results. What do you conclude? (7 marks) 2.2 The relationship between low birthweight babies and mother’s smoking status. You wish to invesTgate the a³ributable fracTon for risk of low birthweight and mother’s smoking status, so you will need the risk diference . Calculate an esTmate of the risk di²erence and its 95% conFdence interval for the risk of low birthweight between mothers who are current smokers and mothers who are not current smokers. State these as a percentage to 1 decimal place. ±est the hypothesis that this risk di²erence is zero, fully reporTng on the test and its results. What do you conclude? (7 marks) HSH746 BiostaTsTcs 1 Assignment 3 Page 2 of 3
