# Re: Vispedia: Interactive Visual Exploration of Wikipedia Data via Search-Based Integration

From: Kei Cheung <kei.cheung@yale.edu>
Date: Mon, 10 Nov 2008 13:20:48 -0500
To: "Luciano, Joanne S." <jluciano@mitre.org>
Cc: "Maged N.K. Boulos" <mnkboulos@gmail.com>, public-semweb-lifesci@w3.org
Message-id: <49187B80.4060209@yale.edu>
```
I'm glad you picked that up, Joanne. You're right that the scatter plot
was not correctly generated, as the X and Y axes require numerical
values. In the example table, the age ranges are not interpreted by
Vispedia as numerical values. To address this issue, I used Vispedia to
generate a different plot whose link was included in a follow-up email
(I also included the link below). In this new plot, the X axis
corresponds to the incidence rates, while the age ranges are used to
label the points.

http://vispedia.stanford.edu/index.php/vis/353/AnyLine#/?cp0=0&f0=Incidence%20(new%20affected)%20%20per%20thousand%20%20person%C3%A2%C2%80%C2%93years&cp1=0&f1=Age&cp2=0&f2=&cp3=0&f3=&cp4=0&f4=

I've tried more sophisticated tables like the one included in the

http://en.wikipedia.org/wiki/Image:Rational_scale_to_assess_the_harm_of_drugs_(mean_physical_harm_and_mean_dependence).svg

I couldn't get Vispedia to work on plotting the drug abuse data listed
in this table. Perhaps I should relay this problem to the Vispedia team.

Cheers,

-Kei

Luciano, Joanne S. wrote:
> This is the Wikipedia data:
>
> Age 	Incidence
> 	(new affected)
> 	per thousand
> 	person–years
> 65–69 	 3
> 70–74 	 6
> 75–79 	 9
> 80–84 	23
> 85–89 	40
> 90–  		69
>
> This is the Vispedia output.  I haven't had time to retrace the
> construction ... or to try it myself, but there appears to be a
> discrepancy.
>
> This may be a browser issue too, I'm in Firefox.
>
> What's coming out is a plot with (x,y) as follows:
>
> (3,3)
> (6, 6)
> (9,9)
> (23,23)
> (40,40)
>
> Here's a paste:
> X
> [ Age ]
>
> Y
> [ Incidence (new affected) per thousand personÃÂ¢ÃÂÃÂyears ]
> Age: 65–69 → Age: 0 → Incidence (new affected) per thousand
> person–years: 3.03.0
>
> Incidence (new affected) per thousand person–years: 3.03.0
> Age: 70–74 → Age: 1 → Incidence (new affected) per thousand
> person–years: 6.06.0
>
> Incidence (new affected) per thousand person–years: 6.06.0
> Age: 75–79 → Age: 2 → Incidence (new affected) per thousand
> person–years: 9.09.0
>
> Incidence (new affected) per thousand person–years: 9.09.0
> Age: 80–84 → Age: 3 → Incidence (new affected) per thousand
> person–years: 23.023.0
>
> Incidence (new affected) per thousand person–years: 23.023.0
> Age: 85–89 → Age: 4 → Incidence (new affected) per thousand
> person–years: 40.040.0
>
> Incidence (new affected) per thousand person–years: 40.040.0
> Age: 90– → Age: 5 → Incidence (new affected) per thousand person–years:
> 69.069.0
>
> Sorry I wasn't able to take the time to provide more answers than
> questions....
>
> Joanne
>
>
>> -----Original Message-----
>> From: public-semweb-lifesci-request@w3.org [mailto:public-semweb-
>> lifesci-request@w3.org] On Behalf Of Kei Cheung
>> Sent: Sunday, November 09, 2008 10:31 PM
>> To: Maged N.K. Boulos
>> Cc: public-semweb-lifesci@w3.org
>> Subject: Re: Vispedia: Interactive Visual Exploration of Wikipedia
>>
> Data
>
>> via Search-Based Integration
>>
>>
>> Hi Maged,
>>
>> Thanks for sharing the paper. It's very interesting work incorporating
>> analysis and visualization into wiki. I also found the following demo
>> video:
>>
>> http://graphics.stanford.edu/projects/vispedia/vispedia-trailer-tr.mov
>>
>> Just for fun, I searched for "Alzheimer" in Wikipedia and it brought
>>
> me
>
>> to the Alzeihmer's disease page. In the Epidemiology section, there is
>>
> a
>
>> table listing AD incidence rates after 65 years of age. I used
>>
> vispedia
>
>> to visualize this table and create the following scatter plot:
>>
>> http://vispedia.stanford.edu/vis/353/Scatterplot#/?cp0=0&f0=Age&cp1=0&
>>
> f1
>
>> =Incidence%20(new%20affected)%20%20per%20thousand%20%20person%C3%A2%C2
>>
> %8
>
>> 0%C2%93years&cp2=0&f2=&cp3=0&f3=&cp4=0&f4=
>>
>> Cheers,
>>
>> -Kei
>>
>> Maged N.K. Boulos wrote:
>>
>>
>>> Given the recent interests of some members of this list in Wiki
>>> applications like WikiNeuron and novel information visualization
>>> techniques, this paper might prove useful and inspiring:
>>> Chan B, Wu L, Talbot J, Cammarano M, Hanrahan P.
>>>
>>>
>> <http://www.ncbi.nlm.nih.gov/pubmed/18988966?ordinalpos=1&itool=Entrez
>>
> Sy
>
>> stem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_DefaultReportPanel.Pub
>>
> me
>
>> d_RVDocSum>
>>
>>> *Vispedia: Interactive Visual Exploration of Wikipedia Data via
>>> Search-Based Integration*.
>>> /IEEE Trans Vis Comput Graph/. 2008
>>>
> November-December;14(6):1213-1220.
>
>>> Stanford University.
>>>
>>> Wikipedia is an example of the collaborative, semi-structured data
>>> sets emerging on the Web. These data sets have large, non-uniform
>>> schema that require costly data integration into structured tables
>>> before visualization can begin. We present Vispedia, a Web-based
>>> visualization system that reduces the cost of this data
>>> integration.&#xD;Users can browse Wikipedia, select an interesting
>>> data table, then use a search interface to discover, integrate, and
>>> visualize additional columns of data drawn from multiple Wikipedia
>>> articles. This interaction is supported by a fast path search
>>> algorithm over DBpedia, a semantic graph extracted from Wikipedia's
>>> hyperlink structure. Vispedia can also export the augmented data
>>> tables produced for use in traditional visualization systems. We
>>> believe that these techniques begin to address the "long tail" of
>>> visualization by allowing a wider audience to visualize a broader
>>> class of data. We evaluated this system in a first-use formative lab
>>> study. Study participants were able to quickly create effective
>>> visualizations for a diverse set of domains, performing data
>>> integration as needed.&#xD;
>>>
>>> PMID: 18988966 [PubMed - as supplied by publisher]
>>>
>>>
>
>
```
Received on Monday, 10 November 2008 18:21:37 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 17:20:39 UTC