[ 1] Cui Z ifeng, Xu Baowen, ZhangW e ifeng, et a.l W eb do cum en ts cluster ing w ith interest links[ C] / / Serv ice-Or iented System Eng ineer ing. IEEE Internationa lW orkshop, 2005: 111-116.
[ 2] Zeng H uajun, H eQ ica,i Chen Zhen, et a.l Learn ing to c lusterw eb sea rh resu lts[ C] / / Proceed ings o f SIGIR-04. Sheffield,2004: 210-217.
[ 3] Sebastiani F. M ach ine learn ing in au tom ated tex t ca tego rization[ J]. ACM Computing Survey, 2002, 34( 1): 1-47.
[ 4] Joach im s T. Tex t categor ization w ith support vec to rm ach ines: Lea rning w ith m any relevan t fea tures[ C ] / / Proceed ing s o f ECML-98. Chemn itz, 1998: 137-142.
[ 5] Schapire R E, S inger Y. Boo stexter: a boosting-based sy stem for tex t ca tego rization[ J] . M achine Lea rning, 2000, 39( 2 /3):135-168.
[ 6] Lu Yuchang, LuM ingyu, L i Fan. Analysis and construc tion of w ord w e ighing function in VSM [ J] . Journa l o f Computer Research& Deve lopm en t, 2002, 39( 10): 1 205-1 210.
[ 7] Xue X iaob ing, Zhou Zh ihua. Distributional fea tures for tex t categor ization[ C ] / / Pro ceedings o f the 17 th European ConferenceonM ach ine Learn ing ( ECML-06). Berlin: LNAI 4212, 2006: 497-508.
[ 8] Lew is D D. N aive( B ayes) at forty: The independence assum ption in inform ation retriev al[ C ] / / Proceed ings of 10th European Con f onM achine Learn ing. Berlin: Spr inger, 1998: 4-15.
[ 9] SaubanM, Pfahr ing er B. Tex t categor ization using docum ent pro filing [ C ] / / Pro ceedings o f PKDD-2003. B erlin: Springer-Ve rlag, 2003: 411-412.
[ 10] C ravenM, D iPasquo D, Fre itag D, et a.l Lea rning to ex trac t sym bo lic know ledg e from theW or ldW ideW eb[ C] / / Proceeding s o fAAA I-98. M ad ison: W I, 1998: 509-516.