Numerical Reasoning

Updated 1 July 2026

What is a good numerical reasoning test score?

Securing a position on a competitive UK graduate scheme or a US summer-analyst program requires clearing early-stage psychometric hurdles. For most corporate roles, the numerical reasoning assessment serves as a high-volume filter. Candidates often wonder what exact score guarantees progression to the assessment centre or superday. Because corporate employers evaluate these metrics relative to applicant pools rather than as raw percentages, understanding how your performance is measured is essential to structuring your preparation effectively.

70th Percentile

Typical corporate pass mark

Varies by employer

85th Percentile

Elite finance and consulting threshold

Approximate baseline

12 to 15

Average question count on adaptive tests

Typically timed

60 to 90 seconds

Average time allocation per question

Strategy dependent

Quick answer

A good numerical reasoning test score is typically at or above the 70th percentile, meaning you outperform 70 percent of the reference norm group. For highly competitive fields like investment banking or management consulting, the employer-set cut-off often rises to the 80th or 85th percentile.

Key points

  • Employers evaluate your performance using percentiles relative to a peer norm group rather than a raw percentage of correct answers.
  • A score in the 70th percentile or higher is generally safe for corporate graduate programs, while elite firms require the 80th to 85th percentile.
  • The exact passing threshold fluctuates based on the overall volume and quality of applications received during that specific hiring cycle.
  • Candidates rarely receive their raw scores or percentile rankings; you only discover if you passed by progressing to the next round.

Percentiles Versus Raw Percentage Scores

To understand your performance, you must separate your raw score from your percentile rank. A raw score is simply the number of questions answered correctly divided by the total number of items. In school or university maths assessments, a 70 percent raw score is objectively good. However, on a psychometric assessment, a raw score of 70 percent can translate to a poor percentile or an excellent one depending entirely on the difficulty of the test variation and how your specific peer group performed.

Employers utilize norm groups to put your score into context. A norm group consists of a representative sample of individuals with similar backgrounds, such as global graduates or engineering professionals. If an assessment is exceptionally difficult, answering only 50 percent of the questions correctly might place you in the 75th percentile, which would clear most corporate benchmarks. Conversely, on an easier assessment, an 80 percent raw score might only land you in the 55th percentile, resulting in an automatic rejection.

Industry Benchmarks for UK and US Programs

Passing standards are entirely employer-set and vary widely by sector. For general corporate graduate schemes or rotational new-grad programs in sectors like retail, logistics, or public sector advisory, the cut-off benchmark is commonly reported as around the 60th to 70th percentile. In these tracks, employers use the assessment as a basic baseline check to confirm you possess the foundational numerical literacy required to manage budgets and analyze basic spreadsheets.

The threshold changes significantly when applying for high-compensation roles. Elite management consultancies and global investment banks regularly set their automated filters at the 80th or 85th percentile. Because these firms receive tens of thousands of applications for a limited number of summer-analyst or analyst seats, they use high numerical thresholds to reduce the applicant pool to a manageable size before human recruiters ever look at a CV or resume.

Profiling the Major Psychometric Test Providers

Different test publishers utilize unique scoring structures and presentation formats. Understanding which provider your target employer utilizes will allow you to adjust your pacing and accuracy strategies.

SHL Verify Assessments

SHL is one of the most common providers globally. Their standard Verify interactive or multiple-choice tests evaluate your ability to interpret charts, trends, and financial statements. They compare your performance against specific industry norm groups, and their reporting system highlights your accuracy alongside your total speed.

Korn Ferry and Talent Q

Talent Q assessments, managed by Korn Ferry, are explicitly adaptive. The system adjusts the difficulty of each consecutive question based on whether your previous answer was correct. A higher difficulty level yields a higher potential score ceiling, meaning you cannot achieve a top percentile if you only answer easy questions.

Cappfinity Numerical Reasoning

Cappfinity focuses on a blend of speed, accuracy, and core capability. Unlike traditional timed tests where you run out of time completely, some Cappfinity variants assess how long you take to answer questions without a hard cut-off per item, penalizing excessive deliberation while measuring your natural preferences.

Saville Assessment

Saville tests are known for intense time pressure, often splitting assessments into short subtests. A high score requires rapid processing of complex data structures. They provide employers with both a total aptitude score and specific sub-scores focusing on data analysis or financial calculations.

The Fluctuation of Pass Marks Across Hiring Cycles

A common point of frustration for applicants is encountering a fluid passing standard. An applicant might achieve a score that would easily secure an assessment centre invitation in November, yet find themselves rejected using the exact same score profile in February. This occurs because corporate cut-offs are frequently dynamic rather than fixed.

When a graduate scheme or summer-analyst program opens its application window, recruiters may initially set a baseline threshold at the 70th percentile. If the firm experiences an unexpected surge of highly qualified applicants, they will often raise the automated threshold to the 80th or 85th percentile mid-cycle to manage the pipeline. This makes early application advantageous, as the pool against which you are compared is smaller and the cut-off limits are often more relaxed.

Balancing Speed and Accuracy for a Maximize Score

The mathematical formula that determines your final percentile takes both speed and accuracy into account, though publishers weight them differently. On traditional non-adaptive assessments, incorrect answers do not deduct points; your score is based purely on the number of correct answers completed within the time limit. On these tests, a blind guess in the final seconds is mathematically logical.

On modern computer-adaptive tests, a string of early errors signals the algorithm to drop your difficulty level, severely damaging your peak score potential. For these platforms, rushing to finish every question at the expense of accuracy is a failing strategy. It is far better to answer 80 percent of the questions with absolute accuracy than to rush through 100 percent of the questions while guessing every third answer. Dedicated practice on platforms like Intervyo helps you learn to find your optimal equilibrium between operational speed and mathematical precision.

How Scores Influence the Broader Selection Process

Firms handle numerical data in two distinct ways: as a hard eliminator or as part of a holistic profile. In a hard-elimination model, if the software reports that you fell at the 69th percentile against the norm group and the corporate filter is set to 70, your application is automatically rejected without human review, regardless of your perfect university grades or excellent CV or resume.

Alternatively, some progressive employers utilize a balanced matrix scoring system. They combine your numerical reasoning percentile, verbal reasoning percentile, and situational judgement results into a single composite score. A stellar performance on your situational judgement assessment can sometimes offset a near-miss on the numerical subtest, provided your numerical score does not fall below a absolute minimum safety floor, which is typically around the 50th percentile.

How it works

How psychometric scoring engines calculate your result

The mechanics underlying psychometric scoring depend heavily on Item Response Theory (IRT) rather than simple classical test theory. Under an IRT framework, each question in the test bank is assigned a specific difficulty rating, a discrimination index, and a guessing factor based on historical data from thousands of previous test-takers. The scoring engine evaluates your capability profile continuously based on the exact characteristics of the items you get right or wrong, rather than treating every question as a single flat point.

When you complete the test, the software generates a standardized score, which is then translated into a percentile or a sten score ranging from 1 to 10. A sten score of 5 or 6 represents the exact average of the population, while a sten of 8, 9, or 10 corresponds to the top percentiles required by elite employers. The software automatically applies the employer selected norm group comparison; for instance, your raw outputs might be compared against a UK graduate norm group or a US finance analyst norm group, shifting your final score alignment.

Anti-cheating mechanisms are embedded directly into the scoring methodology. Modern psychometric platforms track micro-behaviors, including background tab switches, mouse movement patterns, and your specific response latency per question. If a candidate spends 4 seconds on a highly complex multi-stage calculation and inputs the correct answer, the system flags the item for potential compromise or external assistance.

Furthermore, many firms require a short, proctored verification test at the live assessment centre or superday. This validation test acts as a safeguard; the system compares your live, supervised performance against your initial unproctored online test. A significant statistical discrepancy between the two scores indicates a security breach, resulting in immediate disqualification from the hiring pipeline.

How to prepare

  1. 01

    Identify the test provider

    Check your official invitation email or corporate portal to determine if you are sitting an SHL, Saville, Cappfinity, or Korn Ferry assessment, as this dictates the scoring model.

  2. 02

    Establish a benchmark baseline

    Take an initial, timed practice test without preparation to discover your unassisted percentile rank and pinpoint your specific mathematical weak points.

  3. 03

    Drill specific numerical concepts

    Focus your revision on multi-currency conversions, percentage increases and decreases, margin calculations, and compound interest ratios, which form the core of corporate assessments.

  4. 04

    Master your specific calculator hardware

    Use the exact physical calculator model permitted during the test, ensuring you can execute multi-step bracket calculations rapidly without input errors.

A preparation timeline

  1. Two weeks before

    Take diagnostic tests to identify your provider format and isolate your mathematical weak points.

  2. One week before

    Conduct timed, focused subtest drills under strict conditions to align your pacing with your target percentile.

  3. Two days before

    Complete full-length practice assessments on Intervyo to solidify your speed-to-accuracy ratio and review error patterns.

  4. The day of the test

    Ensure a quiet testing environment, verify your internet stability, and prepare your physical scratch paper and calculator.

How candidates approached it

Anonymised accounts of how recent applicants prepared, what they experienced, and how it turned out.

Corporate Finance Track / UK Market / Accepted

Experience. I applied for a top-tier retail energy graduate scheme in London. The invitation was for an SHL interactive assessment, and I was terrified because my university background is in humanities, not maths. I spent ten days drilling chart interpretations and learning how to ignore distractor data in complex graphs. I did not finish the last two questions before the timer ran out, but I focused heavily on accuracy for the items I did complete. I passed the stage and was invited to the assessment centre within forty-eight hours, proving that you do not need a perfect raw score to clear the percentile hurdle.

Outcome. Accurate pacing matters more than guessing every item.

Investment Banking Summer Analyst / US Market / Rejected

Experience. I targeted three bulge-bracket investment banks in New York for summer analyst roles. I rushed through the online numerical assessments, assuming that finishing every question as fast as possible would yield a high score. I guessed on at least five questions where the data charts looked confusing just to maintain a fast pace. I received automated rejections from two of the firms the next morning. I realized too late that their computer-adaptive scoring models heavily penalize strings of incorrect answers by lowering your difficulty ceiling.

Outcome. Rushing and guessing on adaptive tests destroys your percentile ranking.

Questions to practise

A bank of adjacent questions candidates run into. Drill each one in the exact format firms use.

  • What is the difference between an adaptive and a non-adaptive numerical test?
  • How do I calculate percentage changes across multiple fiscal years quickly?
  • Do employers see how long I spend on each individual test question?
  • What mathematical topics are most frequently tested on corporate assessments?
  • Is a calculator permitted on all graduate numerical reasoning tests?
  • How can I convert currencies across compound time horizons under time pressure?
  • What happens if I experience a technical failure during my online assessment?
  • Why did I fail a numerical test despite answering most questions correctly?
  • How does an employer use a sten score to evaluate global applicants?
  • Can I retake a corporate numerical test if I miss the cut-off mark?
Read the full guidePsychometric Test Practice

This answer is general guidance for orientation, not a guarantee. Test formats, timings and employer cut-offs change, so verify the details on the provider or employer site before you apply. Last updated 1 July 2026.

Related questions

There is no universal passing score for an SHL test because the benchmark is determined by the employer. Most standard corporate schemes require you to fall within the 70th percentile or above compared to their graduate norm group, while investment banks often require an 80th percentile score.

More answers

More Numerical Reasoning questions

Practice, not theory

Reading the answer is not the same as being ready.

Intervyo turns this into practice: Psychometric Test Practice in the exact format firms use, scored by AI with feedback on what to fix. Start free, no card required.

Try Psychometric Test Practice

Numerical Reasoning

Know the answer. Now do the reps.

Every firm Pack turns the full process into practice: HireVue, psychometric and online assessments, live mock interviews, CV review and the process map, all in one place.

Browse all firms

Free to start, no card required