Cluster by rows or columns?
Euclidean distance.
Perform hierarchical clustering. matrix must be rectangular and represents the data matrix. distance is the distance metric, which must be a function that accepts two equal-length input ranges of doubles. linkage is the linkage function, which must accept a double[] that represents all possible pairwise distances between two clusters and return a summary of these distances.
Used for mean linkage.
A tree for defining hierarchical clusters.
Not very efficient, though it probably doesn't need to be because the use case is visualizations, and all the information has to fit reasonably on the visualization. Therefore, N will always be fairly small.
Copyright (C) 2011 David Simcha
Boost Software License - Version 1.0 - August 17th, 2003
Permission is hereby granted, free of charge, to any person or organization obtaining a copy of the software and accompanying documentation covered by this license (the "Software") to use, reproduce, display, distribute, execute, and transmit the Software, and to prepare derivative works of the Software, and to permit third-parties to whom the Software is furnished to do so, all subject to the following:
The copyright notices in the Software and this entire statement, including the above license grant, this restriction and the following disclaimer, must be included in all copies of the Software, in whole or in part, and all derivative works of the Software, unless such copies or derivative works are solely in the form of machine-executable object code generated by a source language processor.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, TITLE AND NON-INFRINGEMENT. IN NO EVENT SHALL THE COPYRIGHT HOLDERS OR ANYONE DISTRIBUTING THE SOFTWARE BE LIABLE FOR ANY DAMAGES OR OTHER LIABILITY, WHETHER IN CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
This file contains functions for performing hierarchical clustering, and can be used for drawing heatmaps and, eventually, dendrograms.