What is structured interviewing?
Structured interviewing means asking every candidate the same questions, in the same order, evaluated against the same criteria. It's the most consistently validated method for predicting job performance across decades of industrial-organizational psychology research.
The opposite β unstructured interviewing β is what most hiring teams default to: different questions for different candidates, evaluation based on overall impression rather than specific criteria, and decisions heavily influenced by rapport, shared background, and other factors that don't predict performance.
Structured screening extends this principle to the resume review stage. Before you ever speak to a candidate, you should know exactly what you're looking for and how you'll determine whether they have it.
Why structured screening outperforms unstructured review
Research consistently finds that unstructured evaluation methods β reading resumes and forming gut-level impressions β are among the least reliable ways to predict job performance. The problems:
- Inconsistency across reviewers. Two interviewers reading the same resume often reach different conclusions. Without shared criteria, there's no way to resolve these differences.
- Inconsistency across candidates. The same reviewer judges candidate five differently than candidate one. Fatigue, contrast effects, and unconscious bias accumulate.
- Bias amplification.Without structure, reviewers default to heuristics: prestige of past employers, quality of resume formatting, perceived likability. These factors have near-zero correlation with on-the-job success.
Structured screening doesn't eliminate these problems β no process can β but it dramatically reduces them by making evaluation criteria explicit and consistent.
How to design effective yes/no screening questions
A good yes/no screening question has four properties:
- Binary. The answer is genuinely yes or no. βDoes the candidate have 3+ years of Python experience?β is binary. βIs the candidate a strong engineer?β is not β it requires interpretation.
- Observable. You can determine the answer from the candidate's resume, LinkedIn, or portfolio. βIs the candidate a good culture fit?β is not observable from a resume.
- Job-relevant. The question tests something that predicts success in this specific role. βDoes the candidate have a computer science degree?β may or may not be relevant depending on the role.
- Anchored. Both βyesβ and βnoβ have clear, written definitions. Reviewers shouldn't have to interpret what counts as a yes.
Questions that predict performance vs. questions that don't
Not all questions are created equal. Research on structured interviewing has identified which question types actually predict job performance and which are noise:
Questions that work
- Past behavior questions (βTell me about a time you...β) β 0.51 correlation with performance
- Job knowledge and skill verification β 0.48 correlation
- Work sample tests and practical exercises β 0.54 correlation
- Structured, anchored rating scales for all of the above
Questions that don't work
- Brain teasers (βHow many golf balls fit in a 747?β) β near-zero predictive validity
- Hypothetical scenarios without structured scoring β low validity
- Questions about strengths and weaknesses β candidates deliver rehearsed answers
- Unstructured βget to know youβ conversation β not predictive, high bias risk
Applying screening questions across your pipeline
Having good questions is step one. Applying them consistently is where most teams fall short. Here's a practical framework:
- Define questions before seeing candidates. Write your screening questions before you look at any resumes. This prevents you from tailoring questions to favor specific candidates you've already seen.
- Assign questions to stages. Must-have criteria go in the initial screen. Nice-to-haves and deeper skill assessment belong in later stages.
- Document every answer.For each candidate and each question, record the answer and the evidence. βYes β lists 4 years of React experience at Acme Corp and built their component library.β
- Review decisions as a team. Periodically audit who advanced and who didn't. Are the criteria being applied consistently? Are there patterns in who gets screened out?