Abstract:
The web is a vast source of information. However, due to the divisiveness and unlikeness of web pages' contents, this information is covered in the chaotic structure of the World Wide Web. At the same time, with the spread of web access, search engines are being, if not the sole utility, one of the most mechanisms used by the increasing number of users, to find interesting information. We are interested in identifying how pieces of information represented by URL pages, sharing common topics, are related as they are represented on the web. One such problem is studying patterns of occurrences of gathered linked pages named communities and their structures in the web. We call this the web design structure problem. Trying to identify different structures of newly emerging communities materializes the structures problem: communities that have little or no dense representation. Moreover, we will try to find shapes that communities can take. This case study analysis is based on graph-theory and some optimization approaches, which will help in the algorithmic engineering necessary for description of simple, iterative, and option algorithms to find community structures. Also, it will include implementations of these algorithms based on a data source extracted manually from the web, an analysis of the results obtained and some ideas for further work.
Description:
M.S. -- Faculty of Natural and Applied Sciences, Notre Dame University, Louaize, 2001; "A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science"; Includes bibliographical references (leaves 50-51).