Tag CLOUD BIG Data
Pointers for Easy Navigation
460Data: Goes through directory of files recursively and processes the data, then storing it into a hash table and a min heap. The checkWord function does the filtering.
Within the heap class, insert does most of the work to make sure the heap is properly maintained.
TagCloud5: btGenerate_Click is the main event handler that calls most of the other functions to generate tags and create genomes for the different generations.
Download links without ZIP are just cs files. ZIP means the entire project.
Processed and Organized Data Extracted from HTML files: 460Data
ZIP: 460Data ZIP
First Take of Tag Cloud Generator(randomize x and y until finds a spot, or shrink all word sizes if enough tries have occured): TagCloud1
ZIP: TagCloud1 ZIP
Second Take of Tag Cloud Generator(randomize array content for each genome and randomize y, then push right, then push down until collision) Flawed Fitness and Genome functions: TagCloud2
ZIP: TagCloud2 ZIP
Third Version, it can find clumps properly: TagCloud3
ZIP: TagCloud3 ZIP
Fourth Version, it has a proper Mutation algorthim that creates a new (mutated) genome from a former genome: TagCloud4
ZIP: TagCloud4 ZIP
Fifth and Final Version, it mutates the genomes and finds the most fit genome of the first two generations to display. Also has a UI: TagCloud5
ZIP: TagCloud5 ZIP
The PPT Presentation: Capstone Presentation