Zur Kurzanzeige

dc.date.accessioned2022-01-21T13:53:00Z
dc.date.available2022-01-21T13:53:00Z
dc.date.issued2021-04-19
dc.identifierdoi:10.17170/kobra-202201045355
dc.identifier.urihttp://hdl.handle.net/123456789/13539
dc.description.sponsorshipGefördert im Rahmen eines Open-Access-Transformationsvertrags mit dem Verlagger
dc.language.isoengeng
dc.rightsNamensnennung 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectcareless respondingeng
dc.subjectgradient boosted treeseng
dc.subjectdata cleaningeng
dc.subjectresponse timeseng
dc.subjectoutlier detectioneng
dc.subject.ddc150
dc.titleDetecting Careless Responding in Survey Data Using Stochastic Gradient Boostingeng
dc.typeAufsatz
dcterms.abstractCareless responding is a bias in survey responses that disregards the actual item content, constituting a threat to the factor structure, reliability, and validity of psychological measurements. Different approaches have been proposed to detect aberrant responses such as probing questions that directly assess test-taking behavior (e.g., bogus items), auxiliary or paradata (e.g., response times), or data-driven statistical techniques (e.g., Mahalanobis distance). In the present study, gradient boosted trees, a state-of-the-art machine learning technique, are introduced to identify careless respondents. The performance of the approach was compared with established techniques previously described in the literature (e.g., statistical outlier methods, consistency analyses, and response pattern functions) using simulated data and empirical data from a web-based study, in which diligent versus careless response behavior was experimentally induced. In the simulation study, gradient boosting machines outperformed traditional detection mechanisms in flagging aberrant responses. However, this advantage did not transfer to the empirical study. In terms of precision, the results of both traditional and the novel detection mechanisms were unsatisfactory, although the latter incorporated response times as additional information. The comparison between the results of the simulation and the online study showed that responses in real-world settings seem to be much more erratic than can be expected from the simulation studies. We critically discuss the generalizability of currently available detection methods and provide an outlook on future research on the detection of aberrant response patterns in survey research.eng
dcterms.accessRightsopen access
dcterms.creatorSchroeders, Ulrich
dcterms.creatorSchmidt, Christoph
dcterms.creatorGnambs, Timo
dc.relation.doidoi:10.1177/00131644211004708
dc.subject.swdUmfrageger
dc.subject.swdDatenger
dc.subject.swdDatenauswertungger
dc.subject.swdBiasger
dc.subject.swdAusreißer <Statistik>ger
dc.subject.swdFehlererkennungger
dc.type.versionpublishedVersion
dcterms.source.identifiereissn:1552-3888
dcterms.source.issueIssue 1
dcterms.source.journalEducational and Psychological Measurement (EPM)eng
dcterms.source.pageinfo29-56
dcterms.source.volumeVol. 82
kup.iskupfalse


Dateien zu dieser Ressource

Thumbnail
Thumbnail

Das Dokument erscheint in:

Zur Kurzanzeige

Namensnennung 4.0 International
Solange nicht anders angezeigt, wird die Lizenz wie folgt beschrieben: Namensnennung 4.0 International