Collections of Interesting Data Tables

Genuine data tables found on the web which seem complex or otherwise noteworthy.

This research is often inactive due to professional commitments. It began on 19th May 2007 and was updated on 6th November 2007.

Feedback is welcome.

Numbers

How Authors Indicate Headers

Collections

Simulated Retrofitting

Investigating how tables might be adapted to become more accessible whilst keeping their meaning. Check the Method for Retrofitting Simulations.

astro:
6 genuine tables from the U.S. Naval Observatory’s Astronomical Applications Department Data Services.
clark2006:
19 genuine tables from Joe Clark’s Table examples for PDF/UA 1 (2006.01.27). (PDF/UA.)
finance:
2 genuine tables about money, with notes in the next section..
form:
1 genuine table with forms controls in it.
odi:
7 genuine tables from Office for Disability Issues (ODI) research, New Zealand.
thatcher:
2 genuine tables examples from USA government, sent to me by Jim Thatcher.
sports:
1 genuine table, with notes in the next section.
tides:
1 genuine Gorleston tide table, UK Broads Authority.

My Bookmarks

From browsing of the web, including deliberate searches for interesting tables. I biased the search towards the more popular websites for any given query.

When I say “Make a variant” or “E-mail them” I am inviting anyone to do it. Help spread the workload!

Astronomy

The Astronomical Almanac from the US Naval Observatory:

Computing

APIs Usage in VB6 “FileInfo” Project by Karl E. Peterson:
The Best Gaming Video Cards for the Money: May 2007 from Tom’s Hardware:
Harmonia GUI Framework by Andrew Fedoniouk:
Keryx (X)HTML Elements Best Practice Sheet by Lars Gunther:
Layout height attributes on body and html elements by Anne van Kesteren:
Optimize string handling in VB6 - Part II by Tuomas Salste:
Linkback by Wikipedia:
The QA Matrix by W3C QA:
date Parameters from the PHP Manual:

Education

A table of worldwide ages of consent, including US states by Avert:
School Teachers’ Review Body Statistical tables as annex to the 2005 written evidence from the DfES by teachernet:
Science and engineering departmental population at doctorate-granting institutions, by field: 1987-94 by the National Science Foundation:

Finance

FTSE 100 Listings from Money Extra with loads more UK stock tables:
Departmental financial statements from Disability Services Queensland:
Uses the same headers+id heirarchical row header patching technique as Stephen Ferg in the USA. E-mail them about any influence.
FTSE ACT 250 by Yahoo! Finance:
University of Wisconsin–Madison Facts: Budget:

Government

Bolton Museums - Contact Us:
Bureau of Labor Statistics, particularly these areas influenced by Stephen Ferg:
Minimal <th> and <td> are used. Minimal headers+id is added to patch up the HTML4 header search algorithm where needed.
National Statistics Online (UK)
It’s all PDF except for commentary and graphs?
TABLE Z-2 - 1910.1000 TABLE Z-2 from the US Department of Labor, Occupational Safety & Health Administration:
Try saying that three times quickly.
Expanded Homicide Data Table 2 from the Federal Bureau of Investigation:

Interactive

Dog selector test:
Faking a table for a form.

Timetables

Events - Lions Club of Fleet:
Timetables - Isle of Man Steam Packet Company Ferry Services:

Products

Fitting Bras, Correct Bra Size and Comparisons from Bigger Bras:

Sports

Detailed Review

I wrote a detailed review of sports tables which included:

The AFB reviewed some sports sites in early 2006, finding problems with data tables. Disabled people can be sports fans, if you hadn’t realised. Heard of the Paralympics?

ESPN

None of their tables use <th>. Their column headers use <td> with CSS to make it bold! But at least retrofitting <th> would be easy. E-mail them about it.

Their data tables are usually given a caption by placing a <td colspan> in the first row which spans all columns in that table. I call this an “embedded caption”. Is it so hard to style <caption>? Test it.

NHL Player Card for Daniel Alfredsson:
NHL Statistics Glossary:
NHL Boxscore:
  1. Untitled table showing scores per quarter:
    • No caption.
    • Row headers include some data.
    • Final column uses bold styling applied via CSS to indicate importance.
    • Top left cell is completely empty.
    • Seems indistinguishable from a layout table.
    • Make a variant.
  2. Three Stars:
    • 3 column layout table.
    • Multiple details per cell.
    • There are no column headers, just an embedded caption.
    • Probably won’t hurt if this was erroneously identified as a data table?
  3. Game Information:
    • 2 column layout table.
    • Multiple details per cell.
    • No column headers, just an embedded caption.
  4. Team Statistical Comparison:
    • Layout table.
    • Contains 6 tables in one cell.
    • Each of these tables is a diagram and not really a data table.
    • Need to see the colours and tell them apart to understand the data.
    • Make a variant where these are genuine data tables without depending on colour.
  5. 1st Period Summary:
    • Uses a <td> spanning the entire table width using align=center in sections where there is no data to report. Imply that is a headers would break this table.
    • Regular data table with one detail per cell.
    • Column headers are repeated.
    • Columns 3 and 4 start with individual headers but are replaced by a spanned header. “Smart colspan” wouldn’t recognise this because it would fail in other tables, IIRC.
  6. 2nd Period Summary, 3rd Period Summary and OT Summary are the same as 1st Period Summary.
  7. Player Summary is a layout table which contains 2 data tables which are the same:
    • 2 rows of column headers.
    • First column header is actually a caption for the table and shouldn’t be alongside the other two table headers. Make a variant.
    • First row headers span several columns.
    • Column headers span a single column.
    • Column headers use abbreviations which are not expanded. Make a variant. Can the text content of an <abbr> element in a column header be an alias for an abbr attribute value?
    • Row headers use <td>.
    • Data is very regular with one detail per cell, except player positions which are in the same column as player names. Make a variant.
  8. Goaltending Summary is a layout table which contains 2 data tables which are the same:
    • Column headers use some abbreviations which are not expanded. Make a variant.
    • 3 rows in total, 1 row of data. A small number of rows does not indicate a layout table.
    • Row header is marked up using <td>.
    • Very regular with one detail per cell.
  9. Shots on Goal:
    • Caption is embedded into the row of headers. Make a variant.
    • Column headers use abbreviations which are not expanded. Make a variant.
    • Row headers use <td>.
    • Very regular with one detail per cell.
MLB Stats 2007:
  1. Sortables:
    • Embedded caption.
    • Column headers are needed to disambiguate the link in each data cell.
    • Using <td colspan="2"> instead of <th colspan="2">. Make a variant. E-mail them about it.
  2. Two-column layout table:
    • First cell in each column uses same markup as genuine table headers elsewhere.
    • The key difference is this table contains other tables. That means it cannot be a data table.
PGA Tour Statistics:
Tiger Woods - Player Card:
  1. PGA Season Overview - 2007:
    • Row headers use plain <td>.
    • 4 rows in total and only 2 are for data. A small number of rows does not indicate a layout table.
    • Very regular data with one detail per cell.
  2. PGA Tour Stat Ranks - 2007:
    • First column header spans two columns even though they contain different details. Make a variant which gives the second column a “value” header.
    • Row headers use plain <td>.
    • Very regular data with one detail per cell.
  3. 2007 Tournaments:
    • Most useful row headers are probably the event names, in column 2:
      • Make a variant where these use <th>.
      • Make a variant where column 2 is swapped with column 1.
    • Regular data but each cell in column 2 and column 4 contains multiple details.
    • Table ends with a full-width row which contains an endnote.
Indy Racing League Race Schedule:
NHRA Results:
Eurosport
Overall Team Standings: Stage 20:
Soccer
League Table - Premier League Soccer (UK):

Elsewhere

Collections I’ve seen but not worked on:

If you send in a collection I shall add it to this list but I might not work on it.

About this Research

I am Ben ‘Cerbera’ Millard. My aims in doing this are:

Feedback

Corrections (no matter how small), better translations of the non-English tables to English, links to other collections and so on are welcome. In order of preference:

  1. Participate in the Data Table Collections (Research) thread of W3C’s public-html mailing list. (Recommended.)
  2. Add to the Accessify Forum topic. (This keeps the work public.)
  3. Send to my e-mail account, cerbera@projectcerbera.com.

Please include both “Table” and “Collection” in any e-mail subject lines to help me track feedback. They can be in any order, with words between them. Plurals are fine.

Method for Retrofitting Simulations

For each table found on the web:

  1. If it is part of an existing collection:
    1. Create a subdirectory for this table.
  2. Otherwise:
    1. Create a new directory for this new collection.
    2. Create a subdirectory for this table.
  3. Create an original.html file with the table markup from the original page.
  4. Create some variants of it, usually these:
    minimal.html:
    Strip the original to the simplest markup without changing cell arrangements. Add border=1 to make structure visible.
    scope.html:
    Add scope attributes to the minimal.html example, with grouping elements as necessary.
    scope-abbr.html:
    Add abbr attributes to the scope.html example where appropriate.
    Special variants:
    • Simpler header arrangements.
    • Adjescant empty cells as spanned empty cells.
    • Translate to English.
    • Add <abbr title>.
    • Non-conformant markup where conformant markup is inadequate.
    • Etc.
  5. Get a feel for conformance and sanity using:
  6. Upload to the web (duh).
  7. Update this page if a new collection was created.

Future?

No more original.html files; they are too big a bottleneck. Dumping links with a summary is more useful for categorising the use cases. It also helps other Participants find things to do.