The most common document formalisation for text classi?cation is the vector space model founded on the bag of words/phrases representation. Some work (see for example [12]) has been done on hybrid representations to capture both structural elements (- ing the graph model) and signi?cant features using the vector model.