Free GEDCOM Analyzer: Inspect, Validate and Visualise Your Family Tree Online
Most genealogists hit the same wall: a GEDCOM file with thousands of individuals, dozens of contributors, and no easy way to see what is wrong, what is missing, or what to research next. Generic tree viewers show you the data — they don't tell you where the holes are, where the duplicates hide, or which decades of your family history are essentially undocumented.
GEDminer is a free GEDCOM analyzer that loads your .ged file entirely in your browser and produces a full diagnostic of the tree in seconds: invalid dates, duplicate individuals, unsourced facts, isolated branches, location gaps, migration patterns, surname clustering, occupation breakdowns, and a weighted data quality score you can track over time. Nothing is uploaded to a server, no signup is required, and the original file never leaves your device.
Use it to audit a tree you've inherited, sanity-check your own work before sharing with cousins, validate a tree you're about to merge, or simply visualise where your ancestors actually came from. Below is exactly what the analyzer detects, mapped to the problems most researchers care about, followed by an in-depth look at how each subsystem works and a long-tail FAQ.
How GEDminer solves it
You don\'t know how complete or accurate your tree is.
A weighted Tree Health Score (Completeness 40%, Sourcing 30%, Consistency 30%) gives you a single number to track over time, plus a community percentile.
Tree Health Score →Dozens of duplicate ancestors entered by different contributors.
Phonetic + fuzzy matching surfaces likely duplicate individuals with confidence scores so you can merge them safely in your main software.
Duplicate Finder →Birth dates after death dates, ages over 120, and other impossible facts.
The integrity scanner flags every logical inconsistency in the tree with the exact GEDCOM record involved.
Error & Consistency Checks →You can\'t see where your family lived or moved over time.
An interactive migration map plots first international moves, peak migration decades, and concentration centroids by surname or branch.
Migration Analysis →You have no idea which census or BMD records are missing.
The Census Toolbox and Vital Sharpener list the exact gaps per ancestor, per country, per decade — the UK, Ireland, USA and Canada are covered out of the box.
Census & BMD Gap Tools →Branches that look isolated from the main tree.
Connected-component analysis surfaces every floating sub-tree and flags surname/region overlaps with the main lineage so you can reconnect them.
Hidden Cousin Connector →What a GEDCOM analyzer actually does
A GEDCOM analyzer reads the standardised text-based genealogy format that most family-history software exports (.ged or .gedcom) and turns it into structured data you can query and visualise. A good analyzer goes well beyond a tree viewer — it cross-references every individual, family, fact, source and place in the file to surface patterns and problems that are impossible to spot manually in a tree of any size.
GEDminer parses your file in three passes: a fast structural pass to read the GEDCOM grammar (level numbers, tags, cross-references — supporting both GEDCOM 5.5 / 5.5.1 and the newer GEDCOM 7 spec), a normalisation pass that tries multiple character encodings (UTF-8, Windows-1252, Mac Roman, Latin-1, ANSEL) so old files with mangled accents render correctly, and a semantic pass that builds an in-memory model of individuals, families, events, sources and citations.
From that model, every report — duplicates, errors, location standardisation, migration, census gaps, occupations, surnames, lifespans — is computed on demand in a background worker so the UI stays responsive even on trees with hundreds of thousands of people.
Why run a free GEDCOM analyzer before sharing your tree
Sharing a family tree with a cousin, uploading it to a collaborative site, or publishing a one-name study all expose your data to scrutiny. Errors that were invisible while you were the only reader become very visible the moment anyone else looks. Running the tree through an analyzer first catches the things you can't unsee:
- Impossible vital dates (deaths before births, marriages at age 4, ages over 120)
- Duplicate individuals that arrived from a merged GEDCOM and were never reconciled
- Encoding corruption that turns "Müller" into "Müller" and "O'Brien" into garbled bytes
- Unsourced facts that look authoritative but have no underlying citation
- Disconnected branches that you forgot were even in the file
- Place inconsistencies that defeat downstream search and mapping
Spending half an hour with the analyzer before publishing or sharing typically eliminates 80% of the issues a careful reader would otherwise spot — and it lifts your data quality score in the process.
The Tree Health Score and what it measures
Every analysis ends with a single weighted score from 0 to 100, broken into three components:
- Completeness (40%) — what fraction of expected facts (birth, death, parents, marriage, place) are actually present.
- Sourcing (30%) — total source citations divided by total facts, expressed as a percentage. A well-sourced tree typically lands above 60%.
- Consistency (30%) — how few logical errors (impossible dates, broken links, encoding issues) the integrity scanner finds, normalised by tree size.
The score is comparable across exports, which means you can re-export after a cleanup pass and immediately see whether your work moved the needle. There's also an optional community percentile so you can see where your tree sits relative to others of similar size.
In-browser parsing: privacy and performance
Genealogy data is sensitive. It contains living relatives, addresses, and sometimes adoption or paternity events that family members would not want exposed. GEDminer's analyzer is engineered around this reality.
The .ged file is opened with the browser's File API, parsed in a Web Worker, held in memory while you use the app, and discarded when the tab closes. If you create an optional account and choose to save an analysis, GEDminer stores a compressed version of the parsed analysis data so you can reopen it later. This may include names, dates, places and family relationships extracted from your GEDCOM. Your original .ged file is still never uploaded or stored.
Performance is just as important. The streaming parser uses incremental \indexOf\ scans rather than loading the whole file into a string at once, so files up to 500MB and trees over 200,000 individuals stay responsive. The UI virtualises long lists, debounces search, and offloads heavy work (duplicate scoring, geocoding, kinship calculation) to background workers.
How GEDminer compares to desktop genealogy software
Desktop genealogy programs are designed for data entry and tree editing. GEDminer is designed for data analysis and quality control. They complement each other rather than compete.
A typical workflow: do your day-to-day editing in your existing program, then once a month export a GEDCOM and run it through GEDminer. The analyzer gives you the prioritised cleanup list (duplicates, impossible dates, missing sources, incomplete families); you make the fixes back in your editor and re-export.
The analyzer is intentionally read-only — it never writes back to your file. This keeps the workflow safe, and it keeps GEDminer compatible with every desktop and online genealogy program that can export standard GEDCOM 5.5 / 5.5.1 / 7.
Step-by-step guides
How to Analyse a GEDCOM File Online
A step-by-step walkthrough for uploading your GEDCOM file and getting instant insights into your family tree - demographics, errors, migration patterns, and research priorities.
What Is a GEDCOM File? The Complete Guide
Everything you need to know about the GEDCOM file format - its history, structure, how to create one, and what you can do with it using modern analysis tools.
How to Find and Fix GEDCOM Errors
A practical guide to detecting impossible dates, missing records, duplicate entries, and other data quality issues in your GEDCOM family tree file.
Finding and Merging Duplicate Individuals
Find potential duplicate individuals in your tree using smart matching, compare their records side-by-side, and learn best practices for merging them.
Understanding Your Tree Health and Data Quality Score
Learn how GEDminer evaluates your tree\'s data quality, what the health score means, and practical steps to improve your tree\'s completeness and accuracy.
Understanding Your Family Tree Dashboard
Master the Overview dashboard to quickly understand your tree\'s scope, completeness, and key patterns with demographic charts and statistics.
How to Merge Two GEDCOM Files Without Losing Data
Combining two family trees is one of the most error-prone tasks in genealogy. This guide walks through a safe merge workflow using GEDminer to spot duplicates, conflicting dates, and overlapping branches before you commit.
How to Recover a Corrupted or Broken GEDCOM File
A GEDCOM that won\u2019t load can feel catastrophic. Most "corrupted" files are actually salvageable with a few targeted fixes. Here is the diagnostic workflow.
Frequently asked questions
Is the GEDminer GEDCOM analyzer really free?
Yes. Every analysis tool runs in your browser at no cost, with no upload limit beyond the 500MB GEDCOM file size cap. An optional free account lets you save analyses for later, but it is not required to use any feature.
Where is my GEDCOM file uploaded?
Nowhere. Parsing happens entirely on your device using the browser\'s file APIs. If you create an account and save a tree, the parsed analysis data is stored securely in your account so you can return to it later. This may include names, dates, places and relationships extracted from the GEDCOM. The original .ged file is never uploaded or stored.
How large a GEDCOM file can the analyzer handle?
GEDminer streams and parses GEDCOM files up to 500MB and has been tested with trees containing more than 200,000 individuals. Performance depends on your device, but the UI stays responsive thanks to background workers and virtualised lists.
What format does the analyzer expect?
Standard GEDCOM 5.5 / 5.5.1 / 7 files (.ged or .gedcom extension). Almost every desktop and online genealogy program can export one of these. Proprietary backup files (.ftw, .ftm, .rmgc) need to be exported as GEDCOM first from inside their original program.
Can I export the analyzer\'s findings?
Yes. Most reports — duplicates, errors, missing facts, migration events, surname distribution, occupations — can be exported as CSV or XLSX so you can paste them into a research log, share with relatives, or use them in a spreadsheet workflow.
Does the analyzer work offline?
After the first load, the core analyzer runs entirely client-side, so a brief drop in connectivity will not interrupt your analysis. The interactive map tiles need a connection to load, but every other report works offline.
Can I analyse a GEDCOM file from MyHeritage, Family Historian, RootsMagic or Gramps?
Yes. Any program that exports a standard GEDCOM 5.5 / 5.5.1 / 7 file is supported. Some programs add custom (vendor-specific) tags; those are preserved during parsing but ignored by the standard analyses.
Will the analyzer modify my GEDCOM file?
No. GEDminer is strictly read-only. It tells you what is wrong and where; you make the fixes in whichever genealogy program you normally use, then re-export and re-analyse to confirm the score has improved.
How accurate is the duplicate detection?
Duplicate matching combines Soundex phonetic codes with Levenshtein edit-distance scoring against names, vital dates and parents. Pairs are only flagged above a 0.55 confidence threshold, and every match is shown with its score so you can decide whether to merge.
Does the analyzer support languages other than English?
Yes. The parser is encoding-aware (UTF-8, Windows-1252, Mac Roman, Latin-1) so non-English names render correctly. Place-name standardisation works on any country, and the migration map covers the entire world. The user interface is currently English-only.
Can I compare two GEDCOM exports to see what changed?
The Tree Health Score is comparable across exports, so re-running the analyzer after a cleanup pass will show whether your changes improved completeness, sourcing or consistency. Detailed per-record diff is on the roadmap.
Related tools
Find and Fix Family Tree Errors Automatically
Detect impossible dates, duplicate ancestors, missing parents and broken relationships in your family tree in seconds. Free, browser-based GEDCOM error checker.
Clean Up Your Family Tree: Remove Duplicates, Fix Dates and Standardise Places
Tidy up an inherited or messy family tree: merge duplicates, standardise place names, fix encoding issues and surface missing facts. Free, browser-based.
Genealogy Data Analysis: Statistics, Maps and Patterns from Your GEDCOM File
Turn your family tree into insight: birth-year charts, migration maps, surname distributions, occupation breakdowns and lifespan trends. Free GEDCOM analytics.
Find Missing Ancestors in Your Family Tree
Uncover missing ancestors, hidden cousins, and unrecorded generations in your family tree. Free analyzer that pinpoints exactly where your research has gaps.
Ready to analyse your tree?
Drop your .ged file into GEDminer and get a full diagnostic in seconds. Your file never leaves your browser.
Upload GEDCOM file