Home > Analyzing Data > 5 Things you can learn about analysis from the Intelligence Community

5 Things you can learn about analysis from the Intelligence Community

December 4th, 2009 admin Leave a comment Go to comments


Abstract
Businesses have a large and growing need to analyze data. This is no easy task today with the exploding volumes of data pouring in from everywhere, and the enormous pressure to turn these mountains of data into information that can be acted upon quickly.

It is no surprise that organizations spend over $15B annually on Business Intelligence (BI) and Data Mining technologies. But with all of the focus on infrastructure technologies, there is little emphasis on the art of analysis (analytics).

This is an area where the private sector would be well served by studying the methods used by the US Intelligence Community. This community has been in the business of understanding massive amounts of data for a long time and the applications are as mission critical as they get.

So, what are the lessons that you can apply to your business today. This multi-part blog series will explore 5 specific areas in more detail.

Part 1. The Power of Link Analysis

Business Intelligence solutions often make use of charts and graphs to communicate information. We think visually and pictures, or information visualizations, can be highly effective at communicating interesting insights to us quickly.

Recognizing this, newer solutions are augmenting their arsenals of charts with new visualizations that are more dynamic and specialized. For example, in addition to standard bar and pie charts, these include heat maps, bubble charts and timelines.

However, one particular visualization that the law enforcement community has used for a long time has been notably absent from mainstream analytics products. This visualization is a Relationship Graph (also called node-and-link diagrams). Relationship Graphs fall under the science of Link Analysis which is used to discover and understand relationships between seemingly unrelated entities. Increasingly, this is becoming an important exercise for businesses in everything from fraud identification to customer and market basket analysis.

The reason that this area, relationship analysis, is getting so much attention is that our information landscape is getting more dynamic by the day. Most analysis tools require you to know what questions to ask in advance. For example, ”What is my revenue by region?” or “How many customers do I have?” However, as soon as you want to explore and navigate through the mountains of information at your disposal, the tool falls short. Yet this is precisely what businesses must do today – Discover the unknown, reveal those insights that provide a competitive advantage.

Again, this is not a new challenge for the intel community. They are routinely presented with massive amounts of data and a charge to discover the non-obvious connections. For example, imagine looking at reams of tabular data relating to flight and housing records of foreign visitors to the country. Now, look at the same data in a relationship graph (Figure 1).

Figure 1 - Link Analysis Diagram of Foreign Visitors

Figure 1 - Link Analysis Diagram of Foreign Visitors

Here, an observation ‘jumps out’ at you; multiple people, coming in on different flights and going to the same address. The human brain is an unprecedented pattern recognition engine and when we identify patterns, we tend to draw inferences almost instantly. This is typical of link analysis. When done well, the resulting insights can be remarkable.

So, how could we apply something like that to a more typical business scenario?

Let’s take a simple example of analyzing retail data relating to sales promotions (gift cards, etc.) that are currently being run. A common chart here is margin by product line (Figure 2); this is the question that I know to ask in advance. But I see here that I am losing money on Computer Games in the Eastern region. This is odd because I would expect this to be a profitable product category.

Figure 2 - CRM Example of Margin by Product Category.

Figure 2 - CRM Example of Margin by Product Category.

So now the question is, why are we losing money here? To explore this in more detail, let’s look at a relationship graph relating to these margin-eroding transactions. Let’s specifically look at a relationship graph that highlights the actual customers along with the promotion that they are using to make the purchases and the state that they live in (Figure 3).

Figure 3

Figure 3

Now, while this graph is starting to get a little busy, where is your eye drawn right away? Note all of the activity relating to the 2 highlighted individuals. Now notice that they both are making all of these purchases on the new Gift Card promotion and they are also both from Massachusetts.

So, we have a couple of individuals that are making high volumes of purchases using these gift cards that I have been issuing. Remember that I didn’t know that I was looking for this in advance. Indeed, this would be a difficult insight to reveal with traditional reporting unless I had a specific query pre-defined that called it out.

But with this particular visualization, the relationship jumps out at you; the pattern is detected. In this case, I have revealed potential fraud activity that would explain why I am losing money on an otherwise profitable product category in one particular geography.

Highly interactive relationship graphs would allow us even more flexibility. For example, if every node, and link, is interactive, we could envision ‘drilling down’ or ‘drilling out’ to extend the analysis even further. The key theme here is being able to proceed with analysis at the speed of the human brain. Rather than having a pre-defined set of questions to ask, allowing the analyst to explore the data and let the resulting insights drive where they go next.

Increasing amounts of data coupled with shrinking time windows to understand and act on the information it contains are driving businesses to seek new approaches to analytics. The ability to express relationships in a visual way is extremely powerful. The IC has been using link analysis for years and it is no surprise that the secret is getting out to the commercial BI world.

Stop back and visit us for part 2 of this series titled “Shift Your Lens”.

  1. Mitch Bell
    December 14th, 2009 at 17:57 | #1

    As someone who uses data visualization this seems quite powerful and different from other solutions on the market. Link analysis can be very useful – showing links between people and the products they buy is one example. There also seem to be many applications in commercial markets that are “investigative” in nature such as fraud and cyber security analysis to look at linkages between people, events, alerts, transactions, account access and more. On the cyber and network security side, I see a variety of applications from network traffic analysis to account access analysis. On-line advertising analysis to identify click fraud also comes to mind. Knowing the relationships between source and destination IP addresses might lead the analysts to timeline analysis which might show unusual account access behavior. I would think geographic location followed by linkages to third party data could also be applied add contact data to the analysis. The idea that I could explore data interactively leads me to believe I could uncover something in my data that I previously did not know.

  2. Marshall W
    December 15th, 2009 at 19:28 | #2

    We try to deliver meaningful data visualization to the appropriate user. The fact that this has link analysis seems quite different than tradional approaches….. The ability to show links between people and products, other people, phone numbers, demographics, employers and other entities is quite useful. This type of visualization can show me relationsthips that I would not have known or discovered through charting or mapping. Most valuable are the links to unrelated data.

  3. Mohammed Aejaz
    December 28th, 2009 at 14:01 | #3

    The new era of “Personalized Medicine” has begun. “Personalized Medicine” is nothing but “individualization” of treatment. This “individualization” is not possible without complete “visualization” of the data – any data — patient’s data — data that pours out from different sources and resource about a subject –subject being patient or plant, biological or otherwise. “Visualization” of the data is the key in making the “Links” between the “Nodes” that our “eyes” catch while “Visualizing” the data.

    hus we enter into the realm of “Link Analysis” In Medicine “ Symptoms” are nothing but the “Nodes” that are “Linked” via unseemingly relationship to the “ Organic enteties. This “Link Analysis” will make diagnosis, prognosis and treatment easier, less expensive and faster.

    As the world moves more towards “personalization” and “Individualization” I predict a rapid growth in the field of “ Link Analysis”. Without “Link Analysis” “ Data Explosion” will soon become a burden.
    I dare to say that we will be entering in the new filed of (MI ) Medical Intelligence via “Link Analysis”.

    Mohammed Aejaz

  4. John
    January 12th, 2010 at 10:04 | #4

    Interesting. If we have not been doing link analysis do we first need to restructure our data in a complex way to get it to work? Is it possible to get a quick view of how link analysis will work on our data? Our domain is investigating how content is used by enterprise sales people and tracking back what is most useful to who and what was authored.

  5. admin
    January 12th, 2010 at 10:44 | #5

    Link Analysis diagrams are very easy to build using structured data. This includes text data. If you have data on content accessed, author, sales person, type of content, you can use Centrifuge to build a link analysis diagram. If the data is unstructured text, there are tools available through our partners to structure this information. Yes, we do offer a way for you to evaluate the technology for free and use your own data. If you click on TRY IT TODAY anywhere on our site, you can submit a short form and get access to the technology. We are also happy to develop a few link analysis diagrams for you if you would like to see own data in Centrifuge.

  6. August 25th, 2010 at 07:03 | #6

    Thanks for your comment Jim. Centrifuge 2.0 (server) is available this week. It does include “flexible workspaces” allowing you to include multiple data visualizations in dashboards at one time. Also part of of this release are enhancements to relationshiip graph. Now analysts can scale the size of nodes and links based on measures that are part of the dataview. Annotation capabilities have been added to all visualuzations. This provides users with the ability to highlight key findings and summarize results for presentation purposes. In 2.0, each of the visualizations is now a “widget” allowing you to embed the visualization inside of your own applications and HTML pages. We will reach out to show you a demo. Finally, as the underlying data changes, the dataview can be updated to reflect changes in the data. There are other highlights of this release we would like to share with you. I’ll post a few screen shots fo you to see now.

  1. No trackbacks yet.
Top