Blogs study may net credible information |
by William J. Sharp
Air Force Office of Scientific Research Public Affairs
6/29/2006 - ARLINGTON, Va. (AFPN) -- The Air Force Office of Scientific Research has begun funding a new research area that includes a study of blogs.
Blog research may provide information analysts and warfighters with invaluable help in fighting the war on terrorism.
Drs. Brian Ulicny, senior scientist, and Mieczyslaw Kokar, president, Versatile Information Systems Inc., Framingham, Mass., will receive approximately $450,000 in funding for the three-year project titled, "Automated Ontologically-Based Link Analysis of International Web Logs for the Timely Discovery of Relevant and Credible Information."
"It can be challenging (for information analysts) to tell what's important in blogs unless you analyze patterns," Dr. Ulicny said.
Patterns include the content of the blogs as well as what hyperlinks are contained within the blog. Within blogs, hyperlinks act like reference citations in research papers, allowing someone to discover the most important events bloggers are writing about. This is the same way a person can find the most important papers in a field by finding which ones are cited most often in research papers. This type of analysis can help information analysts' searches be as productive as possible.
The blog study is part of AFOSR's new Information Forensics and Process Integration research program at Syracuse University in New York. The new portfolio of projects consists of three areas of research emphasis: incomplete information and metrics; search, interactive design and active querying; and cognitive processing.
One of the problems analysts may have with blog monitoring, Dr. Ulicny said, is there is too much actionable information for the analyst to properly analyze.
"We are developing an automated tool to tell analysts what bloggers are most interested in at a point in time," Dr. Ulicny said.
This analysis, Dr. Kokar said, is based on what his company calls the RSTC approach to blog analysis: relevance, specificity, timeliness and credibility. RSTC helps information analysts filter the most important information to study.
"Relevance involves developing a point of focus and information related to a particular focus," Dr. Kokar said.
Timeliness has to do with immediacy -- how important a topic is now.
"Credibility," he said, "is the amount of trust you have in an information source."
Finally, specificity can provide value to information analysts depending on how general or specific they need the information to be.
In some ways, the team's automated project works like a search engine but with a more focused approach. Traditional search engines present users with information based on, for example, the number of times a term appears in a document. The information obtained via a search engine query tends to be similar among the documents returned. Blog postings, however, can be much more dissimilar from one to another.
"What we're doing is a sort of information retrieval," Dr. Ulicny said. "The difference is that in order to find and analyze blog entries, you need to more adequately model how the blogs work on a global scale."
To some degree blog interpretation involves understanding a different form of communication, he said,
"Blog entries have a different structure," Dr. Ulicny said. "They are typically short and are about something external to the blog posting itself, such as a news event. It's not uncommon for a blogger to simply state, ‘I can't believe this happened,' and then link to a news story."
In this example, Dr. Ulicny said, there might not be much of interest in the blog posting, yet the fact that the blogger called attention to this story can be significant to understanding what matters. A good example, he said, is the recent furor in the Muslim world over the publication of cartoons of Mohammad in a Danish newspaper. The original publication wasn't much noticed in the West, but bloggers discussed this event and that possibly contributed to riots worldwide.
"The fact that the Web is a vast source of information is sometimes overlooked by military analysts," Dr. Kokar said. "Our research goal is to provide the warfighter with a kind of information radar to better understand the information battlespace."
Two key concepts to remember about the program are "actionable information" and the "network effect," said Maj. (Dr.) Amy Magnus of AFOSR's Mathematics and Information Sciences Directorate and the Information Forensics and Process Integration program manager.
Actionable information is information associated with consequence such as evidence of a crime or an enemy's attack plan discovered before its execution. Information forensics is both the identification and authoritative communication of actionable information.
The network effect ensures that as new users, information, and services are added, network utility increases. Process integration seeks to achieve the network effect by promoting processes that share resources and user knowledge.
Information forensics and process integration are important study areas for the military due to the growing emphasis on networked operations. The military is a complex culture "where we collect more data than can be efficiently processed," Major Magnus said.
"With net-centric communications, we may soon have the ability to ask more questions than we can answer," Major Magnus said. "Our goal is to evoke a discipline -- both in the scientific and social sense -- where good, insightful questions direct timely data collections and processing."