Personality test hiring accuracy is a topic that divides HR professionals and job seekers alike — and for good reason. Most people instinctively wonder: if candidates can simply “fake good” on a pre-employment personality test, does the whole exercise even matter? Surprisingly, emerging research suggests the answer is more nuanced than a flat “no.” Even when applicants inflate their scores, personality assessments can still offer meaningful predictions about how someone will behave on the job months down the line.
A study published in the Journal of Research in Personality (2018), conducted by researchers from Deakin University in Australia and Singapore Management University, tracked 607 participants over approximately 18 months. By comparing 260 genuine job applicants against 347 non-applicants, the research team was able to directly test whether the well-known problem of job applicant faking good truly destroys the usefulness of personality data in employee selection. What they found challenges some long-held assumptions — and has practical implications for both hiring managers and candidates.
Once again, personality researcher and author of Villain Encyclopedia, Tokiwa (@etokiwa999), will provide the explanation.
※We have developed the HEXACO-JP Personality Assessment! It has more scientific basis than MBTI. Tap below for details.

目次
- 1 Why Job Applicants Inflate Their Personality Scores — and By How Much
- 2 Personality Test Hiring Accuracy for Predicting Positive Workplace Behavior
- 3 Can Pre-Employment Personality Testing Predict Counterproductive Work Behavior?
- 4 How Narrower Personality Facets Can Improve Prediction Accuracy
- 5 Practical Takeaways: What This Means for Candidates and Hiring Professionals
- 6 Frequently Asked Questions
- 6.1 Does faking on a personality test during a job application really affect the results?
- 6.2 How accurately can a pre-employment personality test predict future job behavior?
- 6.3 Which personality trait is the most important predictor of workplace performance?
- 6.4 Should job seekers answer personality tests honestly or try to present their best self?
- 6.5 Can using more detailed personality facets improve hiring assessment accuracy?
- 6.6 Is personality testing alone sufficient for making hiring decisions?
- 6.7 Does the range restriction caused by faking make personality tests statistically useless in hiring?
- 7 Summary: Personality Test Hiring Accuracy Is Imperfect — But Far From Worthless
Why Job Applicants Inflate Their Personality Scores — and By How Much
Research consistently shows that job applicants tend to present a more socially desirable version of themselves on personality tests — but the degree of inflation is more moderate than many assume. Think about it from a candidate’s perspective: if you were completing an assessment that could determine whether you get the job, wouldn’t you be tempted to lean toward the “better” answer? This tendency is called faking good, or socially desirable responding, and it is one of the most debated challenges in pre-employment personality testing.
In this study, applicants scored meaningfully higher than non-applicants on 3 key personality dimensions. The differences were not trivial — effect sizes ranged from approximately 0.6 to 0.9 standard deviations, placing them in the medium-to-large range. In plain terms, that’s a noticeable but not dramatic shift, comparable to scoring slightly higher on a test but not topping every category. Crucially, the inflation was not random. It was concentrated in traits that employers visibly value:
- Conscientiousness — reflecting diligence, reliability, and rule-following behavior
- Agreeableness — reflecting warmth, cooperation, and consideration for others
- Extraversion — reflecting sociability, energy, and confidence in social settings
Notably, traits like emotionality (neuroticism) and openness to experience showed little to no elevation, suggesting that candidates are selectively boosting the dimensions they perceive as professionally desirable rather than inflating everything indiscriminately. This pattern indicates that faking good is real and directional, but not so extreme that it turns someone into a completely different person on paper.
Score Compression: When Everyone Looks the Same
One underappreciated side effect of faking good is that the range of scores shrinks — a phenomenon sometimes called score compression or range restriction. In the study, the spread of scores among applicants was approximately 80% of what it was among non-applicants. Imagine a class where most students score between 70 and 100, compared to a class where scores range from 40 to 100 — the first class is harder to rank because everyone clusters together. This compression makes it statistically harder for personality scores to correlate with future behavior, even if the underlying relationship is real. However, the fundamental structure of personality — the way different traits relate to each other — remained largely intact among applicants. The personality “map” was compressed but not fundamentally distorted.
Does the Core Personality Signal Survive Over Time?
Even after approximately 18 months, personality scores showed meaningful stability in both groups — though slightly less so for applicants. Among non-applicants, the correlation between initial personality scores and scores measured later was around 0.68, a strong figure indicating high consistency. For applicants, this dropped to about 0.54 — lower, but far from zero. This finding suggests that even when someone inflates their answers at the point of application, their underlying personality profile does not disappear entirely. A little performance during testing cannot fully mask who a person actually is.
Personality Test Hiring Accuracy for Predicting Positive Workplace Behavior
One of the most important findings is that personality scores collected during the hiring process still predicted constructive workplace behavior roughly 18 months later, even among applicants who had inflated their answers. This type of behavior — helping a struggling colleague without being asked, picking up litter in common areas, volunteering for tasks outside one’s job description — is what researchers call Organizational Citizenship Behavior, or OCB. It is the kind of discretionary effort that makes teams function well but rarely appears in a formal job description.
OCB was measured using 16 items rated on a 7-point scale, capturing both interpersonal helpfulness and general organizational loyalty. The correlations between personality scores and later OCB ranged from approximately 0.17 to 0.39 — with 0.30 generally considered a moderate and practically meaningful effect in behavioral prediction. Together, the 6 major HEXACO personality dimensions explained about 21% of the variance in OCB among non-applicants, and a somewhat lower but still notable proportion among applicants.
Which Personality Traits Were the Strongest Predictors of Good Behavior?
Conscientiousness and extraversion emerged as the 2 most consistent predictors of OCB, with agreeableness also playing a notable role. Conscientiousness — defined as the tendency to be organized, dependable, and rule-abiding — predicts whether someone will go the extra mile not because they are told to, but because it aligns with their values. Extraversion, which reflects sociability and enthusiasm, relates to OCB because people-oriented individuals naturally tend toward collaborative, helping behaviors. Agreeableness contributes through its core of empathy and concern for others. Emotionality and openness to experience, by contrast, showed weaker and less consistent links to OCB, indicating that not all personality dimensions are equally relevant to cooperative workplace conduct.
How Much Did Faking Good Reduce the Predictive Power?
How Much Did Faking Good Reduce the Predictive Power?
The reduction in predictive accuracy was real but modest — not the catastrophic collapse that critics of personality testing often assume. The average difference in personality-OCB correlations between applicants and non-applicants was only about 0.052 — a small gap. While the variance explained in OCB dropped somewhat in the applicant group, the change was not statistically decisive in all comparisons. Research suggests that predictive power may decline by roughly one-third in an applicant context, but a meaningful signal — accounting for more than 10% of behavioral variance — remained intact. For those who assumed that faking renders personality tests completely useless, the evidence says otherwise.
Can Pre-Employment Personality Testing Predict Counterproductive Work Behavior?
Beyond positive behaviors, the study also examined whether personality scores could predict counterproductive work behavior (CWB) — and found that they could, even among applicants. CWB is the mirror image of OCB: it refers to intentional actions that harm the organization or its members. Examples include deliberately slacking off, sabotaging colleagues’ work, taking company property without permission, or engaging in workplace harassment. These behaviors are costly for organizations, both financially and culturally, making their prediction during hiring especially valuable.
CWB was assessed using 19 items rated on a 7-point frequency scale. The personality-CWB correlations told a clear story, with conscientiousness emerging as the single strongest predictor across both groups.
Conscientiousness as the Key Workplace Risk Indicator
Among all personality dimensions measured, conscientiousness showed the most robust negative relationship with counterproductive work behavior — meaning that individuals who scored lower on conscientiousness tended to engage in more harmful workplace conduct over the following 18 months. The correlation was approximately −0.30, and conscientiousness alone accounted for around 18% of the variance in CWB among non-applicants. This is a clinically and practically meaningful figure in behavioral science. Conscientiousness is broadly defined as the tendency to think carefully before acting, follow rules, and avoid dishonest shortcuts. It is, in essence, a psychological brake on impulsive or harmful conduct. Agreeableness also contributed negatively to CWB prediction, while emotionality and openness showed weaker links.
Did the Applicant Context Weaken CWB Prediction?
The CWB prediction held up reasonably well in the applicant group, though with some weakening. Among applicants, personality explained approximately 11% of variance in CWB, compared to 18% among non-applicants — a reduction of roughly one-third, similar to the OCB pattern. However, the statistical difference between the 2 groups was not always decisive, meaning researchers were cautious about making firm conclusions. One intriguing sub-finding was that the pattern of which traits mattered for CWB differed slightly between applicants and non-applicants — for instance, conscientiousness appeared relatively more dominant in the applicant group, while extraversion played less of a role. However, when age and gender were factored into the analysis, these apparent differences largely faded, suggesting they may not represent genuine structural differences in how personality predicts behavior.
Why Age and Gender Matter for Interpreting the Results
Controlling for demographic variables revealed that much of the apparent difference between applicants and non-applicants in CWB prediction was linked to background factors rather than personality inflation alone. Research suggests that older employees tend to engage in fewer counterproductive behaviors, and male employees show a slightly higher tendency toward CWB on average. When these variables were statistically accounted for, the gap between applicant and non-applicant groups in personality-CWB relationships narrowed further. This finding underscores the importance of sophisticated analysis in employee selection assessment — surface-level comparisons can be misleading without accounting for who is in each group.
How Narrower Personality Facets Can Improve Prediction Accuracy
Beyond the 6 broad personality dimensions, the study also explored whether examining 25 narrower personality facets could sharpen predictive accuracy — and found evidence that it can. Think of broad personality traits like high-resolution terrain maps that give you the general landscape, while narrow facets are like zooming into specific neighborhoods. You get more detail, more nuance, and potentially more useful information for specific decisions.
When only the 6 broad HEXACO dimensions were used, they explained about 18% of CWB variance and 21% of OCB variance among non-applicants. When 25 narrower facets were added to the model, the explained variance for OCB climbed to approximately 32% — an 11-percentage-point increase — and CWB variance explained rose to around 21%. Even among applicants, adding facets improved prediction, though the gains were somewhat smaller. In practical terms, using more granular personality data could meaningfully strengthen the usefulness of pre-employment personality testing.
Which Facets Added the Most Value?
Not all 25 facets contributed equally — some provided meaningful incremental prediction while others added little beyond the broad dimensions. Among non-applicants, several specific facets showed statistically significant links to OCB and CWB above and beyond the broad traits. However, among applicants, fewer facets reached statistical significance, and the set of predictive facets did not always match between the 2 groups. This inconsistency is a caution worth noting: a facet that predicts behavior in a low-stakes research setting may not show the same relationship when measured under the pressure of a job application. The practical implication is that facet-level analysis is promising but should not be applied blindly across all contexts without further validation.
Limitations to Keep in Mind When Evaluating These Findings
Like all research, this study has important limitations that shape how confidently we can apply its conclusions. First, both personality and workplace behavior were measured via self-report — meaning participants described their own actions rather than having an objective observer or organizational record to verify them. This introduces the possibility of social desirability bias at the outcome measurement stage as well. Second, while 607 participants across approximately 18 months is a creditable longitudinal design, larger and more diverse samples would strengthen the generalizability of findings. Third, the study was conducted in a specific cultural and economic context; results may differ across industries, countries, or organizational cultures. Despite these caveats, the time-lagged design — measuring personality now and behavior 18 months later — represents a methodologically rigorous approach that is more demanding than simple cross-sectional snapshots.
Practical Takeaways: What This Means for Candidates and Hiring Professionals
Understanding the science behind personality test validity can help both job seekers and HR professionals make smarter, more grounded decisions. Here is what the evidence suggests for each group:
For Job Candidates: Authenticity Has a Strategic Edge
Extreme faking is both unnecessary and risky. Research indicates that candidates naturally inflate socially desirable traits to a moderate degree — and this is largely unavoidable. However, inflating answers dramatically enough to misrepresent your actual character creates a different problem: if your personality scores suggest high conscientiousness and strong agreeableness, but your behavior over the following year tells a different story, the mismatch can damage trust and career prospects. More practically, the job you “fake your way into” may be a poor fit for who you actually are — leading to dissatisfaction and turnover. Honest, reflective answers tend to place you in environments where you are genuinely more likely to thrive. Think of a personality assessment not as a test to ace, but as a matching tool to use wisely.
For HR Professionals: Don’t Dismiss, But Don’t Overrely
Pre-employment personality testing retains meaningful predictive value even when faking is factored in — but it works best as part of a broader assessment strategy. Here are 4 evidence-informed principles for HR practitioners:
- Prioritize conscientiousness — it is the most consistent predictor of both productive and counterproductive workplace behavior across applicant and non-applicant groups. A candidate’s conscientiousness score, even if mildly inflated, still carries signal.
- Consider facet-level data — broad trait scores give a useful overview, but drilling down into narrower personality facets can increase explained variance by approximately one-third. Work with assessment providers who offer facet-level reporting.
- Pair assessments with structured interviews — personality tests explain 10–21% of behavioral variance, leaving a large portion unexplained. Structured behavioral interviews, work samples, and reference checks add complementary information that personality scores alone cannot provide.
- Account for demographic context — age and gender interact with personality in predicting workplace behavior. Avoid comparing raw personality scores across groups without acknowledging these background factors.
The goal is not to achieve perfect prediction — which no single tool can do — but to incrementally improve the probability of good hiring decisions. Even a modest improvement in predictive accuracy across hundreds of hiring decisions each year can translate into measurable organizational benefits.
Frequently Asked Questions
Does faking on a personality test during a job application really affect the results?
Yes, research shows that job applicants tend to score higher on socially desirable traits — particularly conscientiousness, agreeableness, and extraversion — compared to people taking the same test in a low-stakes setting. The effect sizes range from medium to large (approximately 0.6 to 0.9 standard deviations). However, faking does not completely erase the genuine personality signal. The core structure of an individual’s personality tends to remain detectable even after moderate inflation, and predictive validity — though reduced by roughly one-third — is not eliminated.
How accurately can a pre-employment personality test predict future job behavior?
Research suggests that personality assessments administered during hiring can explain approximately 10–21% of variance in workplace behaviors measured 18 months later, even among applicants who have inflated their answers. While this leaves a large portion of behavior unexplained, a 10–21% predictive contribution is considered meaningful in behavioral science. Conscientiousness is the strongest individual predictor, especially for counterproductive work behaviors. Combining personality data with other selection tools substantially improves overall prediction accuracy.
Which personality trait is the most important predictor of workplace performance?
Among the traits studied, conscientiousness is consistently the strongest and most reliable predictor of workplace behavior — both in predicting constructive behaviors like helping colleagues and in predicting counterproductive behaviors like shirking duties or workplace misconduct. Research indicates that low conscientiousness correlates with higher rates of counterproductive work behavior (correlation approximately −0.30), while high conscientiousness is linked to greater organizational citizenship behavior. This pattern holds across both applicant and non-applicant groups, making it the single most valuable trait to assess in hiring contexts.
Should job seekers answer personality tests honestly or try to present their best self?
Research suggests that a moderate degree of positive self-presentation is natural and nearly universal among applicants — and organizations already account for this. Extreme faking, however, carries risks: it may place a candidate in a role that is a poor personality fit, increasing the likelihood of dissatisfaction and early turnover. More importantly, personality assessments tend to retain enough of the underlying signal to detect broad patterns. Authentic, thoughtful responses are likely to result in better job-person matching, which benefits both the candidate and the employer over the long term.
Can using more detailed personality facets improve hiring assessment accuracy?
Yes — research indicates that moving beyond 6 broad personality dimensions to assess 25 narrower facets can increase explained variance in workplace behavior by approximately one-third. For example, OCB variance explained rose from around 21% to approximately 32% when facets were added. However, the specific facets that predict behavior in low-stakes research settings do not always replicate cleanly in an applicant context, so facet-level findings should be interpreted with caution and ideally validated within a specific organizational setting before being used as primary decision criteria.
Is personality testing alone sufficient for making hiring decisions?
No — personality test validity, while real, is limited. Even under the most favorable conditions, personality data explains roughly 10–32% of variance in workplace behavior, meaning the majority of what drives on-the-job performance lies outside what personality tests capture. Factors like specific skills, cognitive ability, cultural fit, work experience, and situational variables all contribute. Best-practice employee selection assessment combines personality measures with structured behavioral interviews, cognitive ability tests, work samples, and thorough reference checks to build a more complete and accurate picture of each candidate.
Does the range restriction caused by faking make personality tests statistically useless in hiring?
Not entirely. While score compression — where applicant scores cluster together — does reduce the statistical power of personality-behavior correlations, it does not nullify them. Research shows that even after accounting for range restriction, personality scores collected in an applicant context still show meaningful correlations with behavior observed 18 months later. The predictive reduction is estimated at roughly one-third compared to a low-stakes context, but that still leaves a statistically and practically meaningful signal for HR professionals to work with, particularly when combined with other selection tools.
Summary: Personality Test Hiring Accuracy Is Imperfect — But Far From Worthless
The key takeaway from this research is that personality test hiring accuracy is real, resilient, and underestimated. Yes, job applicants inflate their scores — particularly on conscientiousness, agreeableness, and extraversion. Yes, this compression reduces predictive power by an estimated one-third. But it does not eliminate the signal. Personality data collected during the hiring process can still explain meaningful proportions of both positive workplace behaviors (OCB) and harmful ones (CWB) up to 18 months later. Conscientiousness, in particular, stands out as a robust, cross-context predictor of how people actually conduct themselves at work.
For HR professionals, the message is clear: treat personality assessments as one valuable layer in a multi-tool selection strategy rather than either a definitive oracle or a pointless formality. For job seekers, there is genuine value in engaging honestly with these assessments — not just ethically, but strategically, since authentic answers are more likely to land you in a role where your real strengths can shine. Now that you understand what personality tests can and cannot tell us about future workplace behavior, take a closer look at your own profile — and consider which of your genuine traits are your greatest professional assets.
