Date of Award
Winter 3-1-2025
Document Type
Dissertation
Degree Name
Doctor of Philosophy (PhD)
Department
Psychology
First Advisor
Tilman Sheets
Abstract
This paper examined and compared several natural language processing and machine learning techniques in predicting self-reported Big Five personality traits from text responses. The models were validated on the open-source 2019 SIOP Machine Learning Competition dataset (N = 1,689). The techniques evaluated included bag-of-words, Empath dictionary, LSTM networks, fine-tuning Transformer models, and stacked generalization. Results indicated that the present study’s models had lower error in four of the five constructs analyzed. Limitations of the study include use of an MTurk sample and small sample size. Future research should explore similar techniques on larger applicant samples. Practical implications and contributions to the literature are also discussed.
Recommended Citation
Meyer, Joseph Uran, "" (2025). Dissertation. 1039.
https://digitalcommons.latech.edu/dissertations/1039