Date of Award

Winter 3-1-2025

Document Type

Dissertation

Degree Name

Doctor of Philosophy (PhD)

Department

Psychology

First Advisor

Tilman Sheets

Abstract

This paper examined and compared several natural language processing and machine learning techniques in predicting self-reported Big Five personality traits from text responses. The models were validated on the open-source 2019 SIOP Machine Learning Competition dataset (N = 1,689). The techniques evaluated included bag-of-words, Empath dictionary, LSTM networks, fine-tuning Transformer models, and stacked generalization. Results indicated that the present study’s models had lower error in four of the five constructs analyzed. Limitations of the study include use of an MTurk sample and small sample size. Future research should explore similar techniques on larger applicant samples. Practical implications and contributions to the literature are also discussed.

Share

COinS