<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.3.2" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>
<channel>
	<title>Comments on: VisuMap</title>
	<link>http://voyagememoirs.com/pharmine/2008/07/22/visumap/</link>
	<description>Data mining in Pharmacy</description>
	<pubDate>Sun, 20 May 2012 14:16:39 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.2</generator>
		<item>
		<title>By: Richard</title>
		<link>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-426</link>
		<dc:creator>Richard</dc:creator>
		<pubDate>Tue, 14 Oct 2008 16:21:34 +0000</pubDate>
		<guid>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-426</guid>
		<description>James,

How can I contact you ?  Your server/site bounces and won't redirect email.</description>
		<content:encoded><![CDATA[<p>James,</p>
<p>How can I contact you ?  Your server/site bounces and won&#8217;t redirect email.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James X. Li</title>
		<link>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-268</link>
		<dc:creator>James X. Li</dc:creator>
		<pubDate>Tue, 26 Aug 2008 02:54:08 +0000</pubDate>
		<guid>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-268</guid>
		<description>It should be noticed that, apart from mapping algorithms, the distance metric
you choose to measure the dissimilarity between data points also plays an very important role in these kind of analysis. If you are using the fingerprints vectors which, i suppose, are binary feature flags, I would suggest to try other metrics like Jaccard or Dice distance. 

VisuMap allows you to plug-in your own distance functions to characterize dissimilarities. For those standard metrics, like jaccard distance, there are free ready-to-use plugin modules.</description>
		<content:encoded><![CDATA[<p>It should be noticed that, apart from mapping algorithms, the distance metric<br />
you choose to measure the dissimilarity between data points also plays an very important role in these kind of analysis. If you are using the fingerprints vectors which, i suppose, are binary feature flags, I would suggest to try other metrics like Jaccard or Dice distance. </p>
<p>VisuMap allows you to plug-in your own distance functions to characterize dissimilarities. For those standard metrics, like jaccard distance, there are free ready-to-use plugin modules.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yap Chun Wei</title>
		<link>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-198</link>
		<dc:creator>Yap Chun Wei</dc:creator>
		<pubDate>Fri, 25 Jul 2008 02:10:10 +0000</pubDate>
		<guid>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-198</guid>
		<description>Hi Krishnakumari,

The size of the dataset used is only 171 compounds.


Hi James,

Thanks for the clarification. Yes, I am aware that VisuMap requires DirectX library but since my machines are all linux-based, I can only run it using Windows that is under VMWare. I had actually also asked my graduate student to try the software and she mentioned that the 3D navigation control is rather good. 

I will be looking more into the clustering algorithms next. However, as I had mentioned in my previous post, I am not an expert in visualization software. I am still learning how to best utilize such software for research, in particular in QSAR research. Thus I hope readers don't really treat these posts as reviews on the software but rather treat them as just personal observations of an amateur using the software. I will welcome comments from readers and yourself on how to more effectively use such software and correctly any inaccuracies that may inadvertently arise.</description>
		<content:encoded><![CDATA[<p>Hi Krishnakumari,</p>
<p>The size of the dataset used is only 171 compounds.</p>
<p>Hi James,</p>
<p>Thanks for the clarification. Yes, I am aware that VisuMap requires DirectX library but since my machines are all linux-based, I can only run it using Windows that is under VMWare. I had actually also asked my graduate student to try the software and she mentioned that the 3D navigation control is rather good. </p>
<p>I will be looking more into the clustering algorithms next. However, as I had mentioned in my previous post, I am not an expert in visualization software. I am still learning how to best utilize such software for research, in particular in QSAR research. Thus I hope readers don&#8217;t really treat these posts as reviews on the software but rather treat them as just personal observations of an amateur using the software. I will welcome comments from readers and yourself on how to more effectively use such software and correctly any inaccuracies that may inadvertently arise.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James</title>
		<link>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-197</link>
		<dc:creator>James</dc:creator>
		<pubDate>Wed, 23 Jul 2008 15:55:58 +0000</pubDate>
		<guid>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-197</guid>
		<description>Thanks for reviewing our software. The 3D animation service in VisuMap
requires DirectX library as documented in the installation guide. 
The navigation of the 3D maps is very similar to that 
of the PCA window, except that it is much faster for large datasets 
(&#62;5K data points). The 3D navigation interface 
is modeled like GoogleEarth, so that you can virtually 
fly within your data using your mouse.

For most mapping algorithms in VisuMap the dataset size is limited
to 5000 to 10000 data points. If you have more data points, you 
should use one of the integrated clustering algorithms to 
reduced dataset size to a more manageable size. For instance, 
you can easily reduce a dataset with 1 million data point to 
few thousands clusters with the self-organizing map within few hours.

You can also use the clustering services to color data points
automatically according the their clusters

It should also be pointed out that mapping algorithms like Sammon map and PCA emphasize on the global inter-cluster structure, whereas other mapping algorithms (like the RPM and CCA) emphasize more on the details within clusters.</description>
		<content:encoded><![CDATA[<p>Thanks for reviewing our software. The 3D animation service in VisuMap<br />
requires DirectX library as documented in the installation guide.<br />
The navigation of the 3D maps is very similar to that<br />
of the PCA window, except that it is much faster for large datasets<br />
(&gt;5K data points). The 3D navigation interface<br />
is modeled like GoogleEarth, so that you can virtually<br />
fly within your data using your mouse.</p>
<p>For most mapping algorithms in VisuMap the dataset size is limited<br />
to 5000 to 10000 data points. If you have more data points, you<br />
should use one of the integrated clustering algorithms to<br />
reduced dataset size to a more manageable size. For instance,<br />
you can easily reduce a dataset with 1 million data point to<br />
few thousands clusters with the self-organizing map within few hours.</p>
<p>You can also use the clustering services to color data points<br />
automatically according the their clusters</p>
<p>It should also be pointed out that mapping algorithms like Sammon map and PCA emphasize on the global inter-cluster structure, whereas other mapping algorithms (like the RPM and CCA) emphasize more on the details within clusters.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: krishnakumari</title>
		<link>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-195</link>
		<dc:creator>krishnakumari</dc:creator>
		<pubDate>Tue, 22 Jul 2008 08:53:37 +0000</pubDate>
		<guid>http://voyagememoirs.com/pharmine/2008/07/22/visumap/#comment-195</guid>
		<description>You said you have worked on 1025 dimensions. What about the amount of data you have taken? (1 million  /  0.5 million etc)
i.e datasize you have considered?</description>
		<content:encoded><![CDATA[<p>You said you have worked on 1025 dimensions. What about the amount of data you have taken? (1 million  /  0.5 million etc)<br />
i.e datasize you have considered?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

