hw4 - (b) Compare the algorithms using enron2.txt: i. plot...

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
CSE 5800 Mining/Learning and the Internet—HW4 Due Nov 16, Wed, 6:30pm Submit Server: course=cse5800 , project=hw4 1. Implement BridgeCut with four versions: (a) edge with the highest Bridging Centrality ( C Br ( e ) in the paper) (b) vertex with the highest Bridging Centrality ( C Br ( v )) (c) edge with the highest Betweenness (Φ( e )) (d) vertex with the highest Betweenness (Φ( v )) 2. Allow this parameter: (a) density threshold (densityThreshold in the paper) 3. Measure performance using: (a) Davies-Bouldin index (b) Silhouette Coefficient (handout) 4. Use three groups of data sets: (a) toy data sets on the course web site (b) real data sets on the course web site (c) your own data set 5. Disscuss in a report (in pdf): (a) Sensitivity analysis of parameters using enron2.txt: i. vary density threshold ii. calculate each performance measurement, iii. plot performance vs. density threshold iv. discuss the value for density threshold that seems to achieve the highest performance.
Background image of page 1
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: (b) Compare the algorithms using enron2.txt: i. plot performance vs. density threshold for dierent algorithms ii. plot clustering coecient vs. number of nodes deleted (up to top 10%) for dierent algorithms [Figure 5b in the paper] iii. discuss the relative performance of dierent algorithms 6. Implementation: (a) use one of these programming languages: C, C++, Java, or LISP. (b) input le: a le for vertices and edges (c) two modules: i. BridgeCut: input graph; output: top edge/vertex when it is removed for each cluster, output vertices in the cluster ii. Evaluate: input vertices and cluster membership; output performance 7. Submission: (a) source code (b) your data set (c) report in pdf (d) README.txt (how to compile and run your program on code.t.edu)...
View Full Document

This note was uploaded on 02/10/2012 for the course CSE 5800 taught by Professor Staff during the Fall '09 term at FIT.

Ask a homework question - tutors are online