PaisaWatch

Why this data is incomplete — and why that matters

Everything on PaisaWatch is real and links to its source. But it is not complete — and the gaps are not an accident of this project. They are the visible edge of how hard it is to see what India's elected representatives declare, earn, and do with public money. This page explains, plainly, what is missing, why, and how to read what remains.

The honest coverage, right now

Lok Sabha (lower house)
485 / 543
The rest have no published affidavit analysis.
Rajya Sabha (upper house)
0 / 245
No open, machine-readable per-member record exists.
MLAs (state assemblies)
3,725 / 4,120
Across 28 states + 3 UTs; rest are unanalysed.

The missing people are not hidden by choice. They are precisely the ones for whom no verifiable public record could be found — so we list a coverage count rather than invent their numbers.

Why the data is incomplete

  • There is no single source of truth. Spending, affidavits, attendance and rosters live on different portals, run by different bodies, in different formats, for different years. Assembling one honest picture means stitching a dozen incompatible systems together.
  • Coverage depends on a non-profit doing the state's job.The asset, liability and case figures come from a civic group manually reading scanned nomination affidavits. Where an affidavit is missing or illegible, that person simply isn't analysed — which is why the count is 485 of 543, not all 543.
  • The upper house is a near-blackout.Rajya Sabha members file affidavits too, but no open, per-member, machine-readable version exists. The official portal's data sits behind an access-gated interface; the government open-data roster is from 2016; a leading tracker renders its list only in JavaScript; and the crowd-sourced encyclopaedia was inaccurate — its table mislabelled sitting members' parties. None met a 100%-accuracy bar, so the upper house stays empty rather than wrong.
  • Spending is aggregate, not itemised.India publishes local-area development spending by state and sector — not what each individual representative spent, line by line. State MLA funds aren't published as open data at all.
  • What exists is self-declared.Assets, liabilities and pending cases come from the candidate's own affidavit. A “pending case” is a charge, not a conviction. Declared assets are what was disclosed on a date — not independently audited.

Why this counts as opacity

Data that exists on paper but can't be read at scale isn't truly public. A PDF behind a login, a table only a browser can render, a roster frozen in 2016 — each is technically “available” and practically useless to a voter, a journalist or a researcher. Fragmentation is itself a form of secrecy: when the truth is scattered across incompatible systems, only those with the time, skill and money to assemble it can see it. Transparency that requires a data engineer is not transparency for everyone. And a system that trusts what officials declare about themselves, with little public means to verify, shifts the burden of proof onto the citizen.

Why opaqueness is bad for democracy

  • A vote without information is a guess.Elections only do their job when voters can see who they're choosing — declared wealth, pending cases, attendance, how public money is used. Remove the information and consent of the governed becomes consent in the dark.
  • Sunlight deters misuse. The expectation of being seen changes behaviour. Where records are hard to find, the incentive to cut corners grows — and the cost of hiding is paid by the people who can least afford to dig.
  • Opacity protects power and burdens the public. When accountability data is locked away, the advantage goes to whoever wants to avoid scrutiny.
  • Trust erodes in the dark.When citizens can't verify, rumour fills the gap — and a democracy that runs on rumour instead of records is a fragile one.

How to read this data responsibly

“Pending case” is not guilt. It is a declared, unresolved legal matter — context, not a verdict. Open the source before drawing a conclusion.

“Declared assets” is not audited wealth. It is self-reported, on a date, for one nomination.

“Analysed” is not “all.” 485 of 543 means 58 people have no published affidavit — not that they are clean, and not that they are guilty.

Big numbers aren't scandals by themselves. A large state spends large sums. Size is not wrongdoing — look at what the money bought and whether it was tendered.

Performance metrics are partial. Attendance and questions measure presence and activity, not the quality or impact of someone's work.

Cross-check before you conclude. One figure is a starting point. Follow the source link, compare across years, and read past the headline number.

Absence of data is data too. When a seat can't be filled with verifiable facts, that gap is itself worth noticing — and worth asking why.

What better would look like

A single, official, machine-readable, regularly-updated public register — every elected representative, both houses and every assembly, with declared assets, pending cases, attendance, and the itemised use of public funds — downloadable by anyone, free, forever. Until that exists, tools like this one assemble what is public and flag, honestly, what is not. The gaps on this page are the work that's left for our institutions to do.