Friday, February 15, 2008

Blog Digger

So, as I have mentioned again and again our Final Year Project is based on blogs.
It's basically a combination of three features:
Most Cited Topics
Opinion Retriever
Summarizer.

We are and will be applying data mining and NLP (Natural Language Processing) techniques to perform the above three tasks.
We have collected quite a few and still in the process of collecting more blog posts through the technology of RSS Feeds. These collected posts act as our collected data and will be the input to our system.
We are collecting them by using a software called RSS Feeder by Omar Al Zabir, which collects posts and then stores them in an Access Database. It has a perfect user interface, but what we need is the database.

We intend that by the end our system would be able to perform: (this line has frequently been used in the documents)

Most Cited Topics:
If the user states a time period, Blog Digger would search and display the hot topics of that time period.

Opinion Retriever:
The user specifies a topic and the opinion of bloggers on that topic is displayed in terms of percentages of positive and negative opinions.

Summarizer:
The summary of the selected blog posts is displayed.

No comments: