IBM®
Skip to main content
    United States change      Terms of use
 
 
Select a scope:    
     Home      Products      Services & industry solutions      Support & downloads      My account     
alphaWorks  >  Information management  >  

IBM Frequent Subgraph Miner

A data-mining tool for discovering all frequent substructure patterns from a set of labeled graphs.


Date Posted: February 27, 2004
Overview Requirements DownloadFAQsForum Reviews

1. How do I analyze data?
2. How do I find frequently-occurring patterns?
3. What is code?


1. How do I analyze data?

IBM Frequent Subgraph Miner can analyze a set of undirected graphs that do not contain self-looped vertices. The format of input file is as follows: size(x,y)
    x: graph identifier starting with 1.
    y: size, which means the number of vertices in the graph.
name(x,y)
    x: graph identifier starting with 1.
    y: name of the graph
node(x,y,z)
    x: graph identifier starting with 1.
    y: vertex identifier starting with 1.
    z: vertex labels
link(x,y1,y2,z)
    x: graph identifier starting with 1.
    y1: vertex identifier starting with 1.
    y2: vertex identifier starting with 1.
    z: edge label of an edge between vertices y1 and y2.
Back to top Back to top

2. How do I find frequently-occurring patterns?

Add the ?out option. All discovered patterns are put out as follows when the sample data is used:
    size = 4
    VertexLabels = c22,c22,h3,h3
    code = r,s,0,0,s,0
    count = 189
It means that one of the discovered patterns has 4 vertices, 189 graphs contain this pattern, and its structure is represented by the following adjacency matrix:
    c22 c22 h3 h3
    c22| 0 r s 0 |
    c22| r 0 0 s |
    h3 | s 0 0 0 |
    h3 | 0 s 0 0 |
Back to top Back to top

3. What is code?

In order to reduce the candidates of the frequent subgraphs, the code of an adjacency matrix is defined as follows: The code of an adjacency matrix Xk whose size is k x k is defined as
    code(Xk)=x1,2x1,3x2,3x1,4...cxk-2,kxk-1,k
by using (i,j)-element xi,j of Xk.
Back to top Back to top
Download now Download now

Related technologies

For platform(s):
Linux

For topics:
data mining, frequent pattern


Related resources

Press Articles

 

    About IBM Privacy Contact