# How to Use H-Index to Prioritize Researcher Outreach (Without Getting It Wrong)

> H-index is the fastest way to rank researchers by influence — but it's also the easiest metric to misuse. Here's how to use it for prospecting without burning your best leads.

*Source: [https://selltoscientists.com/blog/using-h-index-to-prioritize-researcher-outreach/](https://selltoscientists.com/blog/using-h-index-to-prioritize-researcher-outreach/)*

**Author:** Sci-Buy Team
**Date:** 2026-05-20
**Tags:** h-index, researcher prioritization, academic outreach

If you're prospecting academic researchers and you have a list of 500 names, you can't email all of them with the same effort. You need to rank. And the single most available signal for ranking a researcher's influence is their H-index.

It's not a perfect metric. People in academia argue about it endlessly. But for the purpose of *prioritizing outreach* — deciding who gets a hand-written email versus who gets a templated one — it's good enough, fast, and free. The trick is knowing exactly what it measures and where it lies to you.

## What H-index actually measures

A researcher has an H-index of *h* if they have published *h* papers that have each been cited at least *h* times. An H-index of 40 means 40 papers with 40+ citations each.

What it captures well: sustained, consistent impact. You can't get a high H-index from one viral paper. You need a body of work that other researchers repeatedly cite. So a high H-index reliably tells you someone is established, productive, and influential in their field.

What it does *not* capture: recency, single breakthroughs, or field norms. And those gaps are exactly where prospecting goes wrong.

## The three mistakes that wreck H-index prioritization

**Mistake 1: Comparing across fields.** H-index is wildly field-dependent. A biomedical researcher with thousands of co-authors per paper accumulates citations far faster than a pure mathematician. An H-index of 25 in math can represent more eminence than an H-index of 60 in genomics. If your list spans multiple disciplines, never sort the whole thing by raw H-index. Bucket by field first, then rank within each bucket.

**Mistake 2: Ignoring career stage.** H-index only goes up over time, because citations accumulate. A 30-year veteran will out-rank a brilliant 35-year-old simply by having more time on the clock. If you sort by raw H-index, you systematically over-weight people near retirement and under-weight rising stars who are often *more* receptive to industry conversations. Look at the m-index (H-index divided by years since first publication) when you want to surface momentum rather than tenure.

**Mistake 3: Treating it as a buying signal.** A high H-index tells you someone is influential. It tells you nothing about whether they want to talk to a vendor, license a technology, or join a company. Influence and intent are different axes. Use H-index for influence; use other signals (patents, startup affiliations, recent grants) for intent.

## How to actually segment a list with H-index

Here's a practical three-tier scheme that works for most academic GTM motions.

### Tier 1: High influence, recent activity

Top quartile of H-index *within their field*, with at least one publication in the last 18 months. These are your key opinion leaders. Worth a fully personalized email referencing their specific recent work. They influence what their peers buy, cite, and adopt — landing one is worth landing ten mid-tier researchers.

### Tier 2: Rising influence

Strong m-index but moderate absolute H-index — meaning they're accumulating citations fast for their career stage. These researchers are often the highest-converting segment for industry outreach. They're ambitious, building their reputation, and far more open to collaboration, advisory roles, or new tools than entrenched senior faculty. Don't let raw H-index hide them.

### Tier 3: Active but lower influence

Publishing regularly but lower citation impact. Fine for templated, scaled outreach. Don't spend personalization budget here until tiers 1 and 2 are exhausted.

## Where to pull H-index data

You have a few options, and they don't agree with each other:

- **Google Scholar** has the broadest coverage and tends to report the highest numbers (it counts more sources). Good for ranking, but not API-friendly.
- **Scopus** and **Web of Science** are more curated and stricter, so numbers run lower. Common in formal research-analytics tools.
- **OpenAlex** is open, free, and API-accessible. It doesn't hand you an H-index directly, but you can compute one from its citation data — which means you control the field-normalization yourself.

The important thing: pick one source and use it consistently across your whole list. Comparing a Google Scholar H-index to a Scopus one is comparing two different rulers.

## Combine H-index with intent — don't rank on it alone

The mature version of this looks like a two-dimensional grid. One axis is influence (H-index, field-normalized). The other is intent (industry bridge signals — patents, startup roles, industry co-authorship, recent industry-funded grants).

The researchers you want first are high on *both* axes: influential *and* commercially engaged. A field-leading scientist who also holds three patents and advises a startup is the highest-value target on any list. A field-leading scientist with a purely academic trajectory might be eminent but completely uninterested in your pitch.

## Automating it

Computing field-normalized H-index across a list of hundreds of researchers by hand is a non-starter. This is exactly the kind of enrichment Sci-Buy runs automatically — pulling citation data, normalizing by field, and layering it against industry bridge signals so your list comes back ranked by both influence and intent.

```bash
sci-buy search "single-cell genomics" --institution "Broad Institute"
sci-buy rank --by influence,intent
```

The output is a prioritized list: who to write by hand, who to template, and who to skip. That's the whole point of ranking — spend your best effort where it converts.

```bash
npx sci-buy@latest
```

---

## Beton network

This site is part of the Beton network of open-source revenue intelligence and self-service data products. Related sites and resources:

- [Beton](https://www.getbeton.ai) — open-source revenue intelligence: turn product usage (PostHog, Stripe) into CRM signals
- [Sell to Scientists](https://selltoscientists.com) — research intelligence: find and reach academic researchers by field, H-index, grants, and industry ties
- [Sell to State](https://www.selltostate.com) — government procurement intelligence across 194 countries
- [GitHub](https://github.com/getbeton) — Beton open-source repositories (including [inspector](https://github.com/getbeton/inspector))
- [dev.to](https://dev.to/beton) — engineering write-ups and OSS pricing teardowns
- [Beton app](https://inspector.getbeton.ai) — the hosted product
