DocumentCode :
594629
Title :
A semantics aware approach to automated reverse engineering unknown protocols
Author :
Yipeng Wang ; Xiaochun Yun ; Shafiq, M. Zubair ; Liyan Wang ; Liu, Alex X. ; Zhibin Zhang ; Danfeng Yao ; Yongzheng Zhang ; Li Guo
Author_Institution :
Inst. of Comput. Technol., Beijing, China
fYear :
2012
fDate :
Oct. 30 2012-Nov. 2 2012
Firstpage :
1
Lastpage :
10
Abstract :
Extracting the protocol message format specifications of unknown applications from network traces is important for a variety of applications such as application protocol parsing, vulnerability discovery, and system integration. In this paper, we propose ProDecoder, a network trace based protocol message format inference system that exploits the semantics of protocol messages without the executable code of application protocols. ProDecoder is based on the key insight that the n-grams of protocol traces exhibit highly skewed frequency distribution that can be leveraged for accurate protocol message format inference. In ProDecoder, we first discover the latent relationship among n-grams by first grouping protocol messages with the same semantics and then inferring message formats by keyword based clustering and cluster sequence alignment. We implemented and evaluated ProDecoder to infer message format specifications of SMB (a binary protocol) and SMTP (a textual protocol). Our experimental results show that ProDecoder accurately parses and infers SMB protocol with 100% precision and recall. For SMTP, ProDecoder achieves approximately 95% precision and recall.
Keywords :
formal specification; inference mechanisms; pattern clustering; protocols; reverse engineering; ProDecoder; SMB; SMTP; application protocol parsing; automated reverse engineering; binary protocol; cluster sequence alignment; keyword based clustering; network traces; protocol message format inference system; protocol message format specifications; semantics aware approach; system integration; textual protocol; vulnerability discovery; Electronic mail; Natural language processing; Postal services; Protocols; Reverse engineering; Semantics; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network Protocols (ICNP), 2012 20th IEEE International Conference on
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4673-2445-8
Electronic_ISBN :
978-1-4673-2446-5
Type :
conf
DOI :
10.1109/ICNP.2012.6459963
Filename :
6459963
Link To Document :
بازگشت