Visualization publications dataset

We are making available a dataset that contains information on IEEE Visualization (IEEE VIS) publications from 1990-2015. The dataset includes a variety of information about each paper including title, authors, DOI, etc., as well as a list of the citations to other previous VIS papers. To download the dataset open the Google spreadsheet through the link below and choose File->Download as. If you spot errors in the dataset feel free to leave a comment in the respective cells. We will try to maintain and fix the spreadsheet.

Current version: 6.00

Existing Visualizations about the Dataset

About the Dataset

The dataset is stored as a google spreadsheet. The data contains the following columns:
  • A: Short title of the conference the paper appeared in. One of {SciVis, InfoVis, VAST, Vis}. See notes on conference names below.
  • B: The year the paper appeared
  • C: The title of the paper
  • D: If we found it, a DOI to the paper. Where a paper did not include a valid DOI (2 cases at this point), a syntactically valid but fake DOI was entered that starts with 10.0000. This column can, thus, be used as a unique identifier for each paper.
  • E: A link to the paper in the IEEE digital library - based on the DOI
  • F: The first page of the paper in the printed proceedings (this information has not been thoroughly checked for correctness)
  • G: The last page of the paper in the printed proceedings (this information has not been thoroughly checked for correctness)
  • H: This column contains an X  if the paper cited is either a capstone, keynote, panel, or poster. Basically - if the column contains an X the paper isn't in a sense a typical scientific conference paper. This column might be removed soon as it is redundant encoding with column I - all X here correspond to the "M"s in column I
  • I: Paper type: one of C (conference paper), J (journal paper), M (miscellaneous). If you would like to filter for scientific research articles, then take all Cs and Ts (or alternatively all papers without an X in column I)
  • J: The abstract of the paper
  • K: Author names. The authors are written as Lastname, Initial and co-authors are separated by a semicolon. Authors are ordered by how they appear on the paper. This information is likely not 100% correct.
  • L: affiliation of the first author. This information has not been cleaned up.
  • M: author ids according to the IEEE Xplore library - if we could find them (note, the IEEE DL does not seem to consistently provide this information anymore)
  • N: A copy of column L but with author names cleaned up and duplicates removed. We attempted to unify different listings/writings of the same author's name to one unique string.
  • O: A list of references to other IEEE VIS papers using IEEEXplore article numbers. This list does not include citations to papers external to this conference series.
  • P: A list of author keywords freely chosen by the authors - extracted as found on the paper pdf
  • Q: Depcreated - this column will be removed from the dataset in one of the next versions. The DOI can now be used as the unique identifier.
    • The IEEE XPLORE article number - if we could find it. This should be a unique identifier
  • R: Deprecated - this column will be removed from the dataset in one of the next versions
    •  sometimes we could not find an article in the IEEE Xplore library - but it was present in the printed proceedings. In this case we added a manually chosen article number following the IEEEXplore style in order to have a unique identifier for each paper. This column contains an x for those papers where we manually chose this number. You will likely not find a paper under this number in the library.
  • S: Deprecated - this column will be removed from the dataset in one of the next versions
    • references to other papers in the dataset based on the IEEEXplore article number. Use column O instead.


The data was collected through a combined manual and automatic approach. Information was mainly extracted from the pdf proceedings and double-checked with information from the IEEEXplore library. Where discrepancies occurred we manually checked the printed or electronic proceedings and added information by hand. We deduped author names (unified different listings of one author's name) to provide cleaner author data using the Jigsaw system.

Notes on IEEE VIS, its Child Conferences and Naming

The IEEE Conference on Visualization started as a conference in 1990 under the name IEEE Visualization (Vis). It quickly grew and in 1995 the IEEE Symposium on Information Visualization (InfoVis) was held. The symposium was later renamed into the IEEE Conference on Information Visualization but kept the acronym InfoVis (not to be confused with the other IEEE Conference on Information Visualization that uses the acronym IV). In 2006 the Symposium  on Visual Analytics Science and Technology (VAST) joined and was later renamed to the VAST conference. The original IEEE Visualization track ran under used the acronym VIS for several years but was later renamed to the IEEE Conference on Scientific Visualization using the acronym SciVis in 2013. For a number of years the three conferences ran together under the umbrella name VisWeek in what you can more or less think of as parallel tracks of a big joint conference. In 2013, however VIS was chosen as the joint acronym for all three conferences and the name VisWeek was dropped.

The dataset is further a bit complicated in that VAST has since 2012 had both conference as well as TVCG journal papers. The top ranked papers are considered journal papers and the other type of papers are considered conference track papers. We chose to mark the conference only papers in the dataset in Column J so that you can filter them if needed. SciVis also experimented with this concept in 2015.

The image below is a graphical explanation of the naming changes for the conference series between 1990 and 2015:

IEEEVis Naming History Chart

NOTE: The image above is explicitly placed into the public domain. If you use it in your own work we would still appreciate a citation to our journal article.


If you use this dataset or the image above we would appreciate a citation. A Metadata Collection about IEEE Visualization (VIS) Publications
Petra Isenberg, Florian Heimerl, Steffen Koch, Tobias Isenberg, Panpan Xu, Charles Stolper, Michael Sedlmair, Jian Chen, Torsten Möller, John Stasko
IEEE Transactions on Visualization and Computer Graphics, Institute of Electrical and Electronics Engineers, 2017, 23. To appear.
(pdf)  (doi)

@article{isenberg:2017:VPD, TITLE = {{ A Metadata Collection about IEEE Visualization (VIS) Publications}}, AUTHOR = {Isenberg, Petra and Heimerl, Florian and Koch, Steffen and Isenberg, Tobias and Xu, Panpan and Stolper,
            Charles D and Sedlmair, Michael M and Chen, Jian and M{\"o}ller, Torsten and Stasko, John}, URL = {}, NOTE = {To appear}, JOURNAL = {{IEEE Transactions on Visualization and Computer Graphics}}, PUBLISHER = {{Institute of Electrical and Electronics Engineers}}, VOLUME = {23}, YEAR = {2017}, KEYWORDS = {Visualization ; publication data ; citation data}, PDF = {},
  DOI = {},


The dataset was prepared primarily by the following people:

With significant help from:

  • Torsten Möller (University of Vienna)
  • Michael Sedlmair (University of Vienna)
  • Panpan Xu (Hong Kong University of Science and Technology)
  • Chad Stolper (Georgia Tech)
  • Jian Chen (UMBC)

And with a little bit of help from:

  • Charles Perin (Inria)
  • Nadia Boukhelifa (Inria)
  • Jean-Daniel Fekete (Inria)
  • Wesley Willett (Inria)
  • Fred Vernier (Université Paris Sud)
  • Pascal Goffin (Inria)
  • Heidi Lam (Google)