Inferring bivariate associations with continuous data from studies using respondent-driven sampling.

Journal: Journal of the Royal Statistical Society. Series C, Applied statistics

Volume: 74

Issue: 2

Year of Publication: 

Affiliated Institutions:  Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA. Section of Infectious Diseases, Boston University School of Medicine, Boston, MA, USA. Alcohol, Tobacco and Other Drug Research Unit, South African Medical Research Council, Cape Town, South Africa. Department of Mathematics and Statistics, McGill University, Montreal, Quebec, Canada. Department of Mathematics and Statistics, University of Massachusetts, Amherst, MA, USA.

Abstract summary 

Respondent-driven sampling (RDS) is a link-tracing sampling design that was developed to sample from hidden populations. Although associations between variables are of great interest in epidemiological research, there has been little statistical work on inference on relationships between variables collected through RDS. The link-tracing design, combined with homophily, the tendency for people to connect to others with whom they share characteristics, induces similarity between linked individuals. This dependence inflates the Type 1 error of conventional statistical methods (e.g. -tests, regression, etc.). A semiparametric randomization test for bivariate association was developed to test for association between two categorical variables. We directly extend this work and propose a semiparametric randomization test for relationships between two variables, when one or both are continuous. We apply our method to variables that are important for understanding tuberculosis epidemiology among people who smoke illicit drugs in Worcester, South Africa.

Authors & Co-authors:  Malatesta Samantha S Jacobson Karen R KR Carney Tara T Kolaczyk Eric D ED Gile Krista J KJ White Laura F LF

Study Outcome 

Source Link: Visit source

Statistics
Citations :  Baraff, A. J., McCormick, T. H., & Raftery, A. E. (2016). Estimating uncertainty in respondent-driven sampling using a tree bootstrap method. Proceedings of the National Academy of Sciences of the United States of America, 113(51), 14668–14673. 10.1073/pnas.1617258113
Authors :  6
Identifiers
Doi : 10.1093/jrsssc/qlae061
SSN : 0035-9254
Study Population
Male,Female
Mesh Terms
Other Terms
bivariate association;continuous data;homophily;network;randomization test;respondent-driven sampling
Study Design
Study Approach
Country of Study
South Africa
Publication Country
England