Visualization publications dataset

We are making available a dataset that contains information on IEEE Visualization (IEEE VIS) publications from 1990-2015. The dataset includes a variety of information about each paper including title, authors, DOI, etc., as well as a list of the citations to other previous VIS papers. To download the dataset open the Google spreadsheet through the link below and choose File->Download as. If you spot errors in the dataset feel free to leave a comment in the respective cells. We will try to maintain and fix the spreadsheet.

Current version: 4.00


Existing Visualizations about the Dataset

About the Dataset

The dataset is stored as a google spreadsheet. The data contains the following columns:
  • A: Short title of the conference the paper appeared in. One of {SciVis, InfoVis, VAST, VIS}. See notes on conference names below.
  • B: The year the paper appeared
  • C: The title of the paper
  • D: If we found it, a DOI to the paper
  • E: A link to the paper in the IEEE digital library - based on the DOI
  • F: The first page of the paper in the printed proceedings (this information has not been thoroughly checked for correctness)
  • G: The last page of the paper in the printed proceedings (this information has not been thoroughly checked for correctness)
  • H: The IEEE XPLORE article number - if we could find it. This should be a unique identifier
  • I: This column contains an X  if the paper cited is either a capstone, keynote, panel, or poster. Basically - if the column contains an X the paper isn't in a sense a typical scientific conference paper.
  • J: Paper type: one of C (conference paper), T (TVCG journal paper), M (miscellaneous). If you would like to filter for scientific research articles, then take all Cs and Ts (or alternatively all papers without an X in column I)
  • K: The abstract of the paper
  • L: Author names. The authors are written as Lastname, Initial and co-authors are separated by a semicolon. Authors are ordered by how they appear on the paper. This information is likely not 100% correct.
  • M: affiliation of the first author. This information has not been cleaned up.
  • N: author ids according to the IEEE Xplore library - if we could find them
  • O: sometimes we could not find an article in the IEEE Xplore library - but it was present in the printed proceedings. In this case we added a manually chosen article number following the IEEEXplore style in order to have a unique identifier for each paper. This column contains an x for those papers where we manually chose this number. You will likely not find a paper under this number in the library.
  • P: A copy of column L but with author names cleaned up and duplicates removed. We attempted to unify different listings/writings of the same author's name to one unique string.
  • Q: A list of references to other IEEE VIS papers using IEEEXplore article numbers. This list does not include citations to papers external to this conference series.

Methodology

The data was collected through a combined manual and automatic approach. Information was mainly extracted from the pdf proceedings and double-checked with information from the IEEEXplore library. Where discrepancies occurred we manually checked the printed or electronic proceedings and added information by hand. We deduped author names (unified different listings of one author's name) to provide cleaner author data using the Jigsaw system.

Notes on IEEE VIS, its Child Conferences and Naming

The IEEE Conference on Visualization started as a conference in 1990 under the name IEEE Visualization (Vis). It quickly grew and in 1995 the IEEE Symposium on Information Visualization (InfoVis) was held. The symposium was later renamed into the IEEE Conference on Information Visualization but kept the acronym InfoVis (not to be confused with the other IEEE Conference on Information Visualization that uses the acronym IV). In 2006 the Symposium  on Visual Analytics Science and Technology (VAST) joined and was later renamed to the VAST conference. The original IEEE Visualization track ran under used the acronym VIS for several years but was later renamed to the IEEE Conference on Scientific Visualization using the acronym SciVis in 2013. For a number of years the three conferences ran together under the umbrella name VisWeek in what you can more or less think of as parallel tracks of a big joint conference. In 2013, however VIS was chosen as the joint acronym for all three conferences and the name VisWeek was dropped.

The dataset is further a bit complicated in that VAST has since 2012 had both conference as well as TVCG journal papers. The top ranked papers are considered journal papers and the other type of papers are considered conference track papers. We chose to mark the conference only papers in the dataset in Column J so that you can filter them if needed. SciVis also experimented with this concept in 2015.

The image below is a graphical explanation of the naming changes for the conference series between 1990 and 2015:

IEEEVis Naming History Chart


Citation

If you use this dataset we would appreciate a citation. 
Below is a suggestion for how to cite it:



Visualization Publication Dataset, http://vispubdata.org/

@Misc{Isenberg:2015:VPD,
  author = {Petra Isenberg and Florian Heimerl and Steffen Koch and Tobias Isenberg and Panpan Xu and Chad Stolper and Michael Sedlmair and Jian Chen and Torsten M\"{o}ller and John Stasko},
  title = {Visualization Publication Dataset},
  year = {2015},
  howpublished = {Dataset: \href{http://vispubdata.org/}{http://vispubdata.org/}\,},
  url = {http://vispubdata.org/},
  note = {Published Jun.\ 2015}
}



A publication about this dataset is in production. Please check back later to update your citation.

Acknowledgements

The dataset was prepared primarily by the following people:

With significant help from:

  • Torsten Möller (University of Vienna)
  • Michael Sedlmair (University of Vienna)
  • Panpan Xu (Hong Kong University of Science and Technology)
  • Chad Stolper (Georgia Tech)
  • Jian Chen (UMBC)

And with a little bit of help from:

  • Charles Perin (Inria)
  • Nadia Boukhelifa (Inria)
  • Jean-Daniel Fekete (Inria)
  • Wesley Willett (Inria)
  • Fred Vernier (Université Paris Sud)
  • Pascal Goffin (Inria)
  • Heidi Lam (Google)