HashTrie:a space-efficient multiple string matching algorithm

The famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive,thus unable to process high speed network traffic efficiently.To solve this problem,a space-efficient multiple string matching algorithm-HashTrie was proposed.This algorithm adopted recursiv...

Full description

Saved in:
Bibliographic Details
Main Authors: Ping ZHANG, Yan-bing LIU, Jing YU, Jian-long TAN
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2015-10-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2015215/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841539566692990976
author Ping ZHANG
Yan-bing LIU
Jing YU
Jian-long TAN
author_facet Ping ZHANG
Yan-bing LIU
Jing YU
Jian-long TAN
author_sort Ping ZHANG
collection DOAJ
description The famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive,thus unable to process high speed network traffic efficiently.To solve this problem,a space-efficient multiple string matching algorithm-HashTrie was proposed.This algorithm adopted recursive hash function to store the patterns in bit-vectors in place of the state transition table in order to reduce space consumption.Further more it made use of the rank operation for fast verification.Theoretic analysis shows that the space complexity of HashTrie is O(|P|),which is linear with the size of pattern set |P|and is independent of the alphabetsize σ.The space complexity is superior to the complexity O(|P|σlog|P|)of AC.Experiments on synthetic datasets and real-world datasets(such as Snort,ClamAV and URL)show that HashTrie saves up to 99.6% storage cost compared with AC,and in the meanwhile it runs at a matching speed that is about half of AC.HashTrie is a space-efficient multiple string matching algorithm that is appropriate to search large scale pattern strings with short lengths.
format Article
id doaj-art-3df6993c9c0e42dfb5f7f3c753ade622
institution Kabale University
issn 1000-436X
language zho
publishDate 2015-10-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-3df6993c9c0e42dfb5f7f3c753ade6222025-01-14T06:53:55ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2015-10-013617218059696478HashTrie:a space-efficient multiple string matching algorithmPing ZHANGYan-bing LIUJing YUJian-long TANThe famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive,thus unable to process high speed network traffic efficiently.To solve this problem,a space-efficient multiple string matching algorithm-HashTrie was proposed.This algorithm adopted recursive hash function to store the patterns in bit-vectors in place of the state transition table in order to reduce space consumption.Further more it made use of the rank operation for fast verification.Theoretic analysis shows that the space complexity of HashTrie is O(|P|),which is linear with the size of pattern set |P|and is independent of the alphabetsize σ.The space complexity is superior to the complexity O(|P|σlog|P|)of AC.Experiments on synthetic datasets and real-world datasets(such as Snort,ClamAV and URL)show that HashTrie saves up to 99.6% storage cost compared with AC,and in the meanwhile it runs at a matching speed that is about half of AC.HashTrie is a space-efficient multiple string matching algorithm that is appropriate to search large scale pattern strings with short lengths.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2015215/intrusion detectionmultiple string matchingbit-vector recursive hash functionspace-efficient
spellingShingle Ping ZHANG
Yan-bing LIU
Jing YU
Jian-long TAN
HashTrie:a space-efficient multiple string matching algorithm
Tongxin xuebao
intrusion detection
multiple string matching
bit-vector
recursive hash function
space-efficient
title HashTrie:a space-efficient multiple string matching algorithm
title_full HashTrie:a space-efficient multiple string matching algorithm
title_fullStr HashTrie:a space-efficient multiple string matching algorithm
title_full_unstemmed HashTrie:a space-efficient multiple string matching algorithm
title_short HashTrie:a space-efficient multiple string matching algorithm
title_sort hashtrie a space efficient multiple string matching algorithm
topic intrusion detection
multiple string matching
bit-vector
recursive hash function
space-efficient
url http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2015215/
work_keys_str_mv AT pingzhang hashtrieaspaceefficientmultiplestringmatchingalgorithm
AT yanbingliu hashtrieaspaceefficientmultiplestringmatchingalgorithm
AT jingyu hashtrieaspaceefficientmultiplestringmatchingalgorithm
AT jianlongtan hashtrieaspaceefficientmultiplestringmatchingalgorithm