Title :
Regular expressions on the web
Author :
Hodován, Renáta ; Herczeg, Zoltán ; Kiss, Ákos
Author_Institution :
Dept. of Software Eng., Univ. of Szeged, Szeged, Hungary
Abstract :
On the web, static pages fade into the past; web sites use server and client-side scripting techniques to improve the user experience. For client-side scripting, JavaScript is by far the most commonly used dynamic language. In these scripts, regular expressions are widely used for several purposes, e.g., for text filtering and form validation. In this paper, we take a closer look at regular expressions on the web. We investigate historical data and determine the trends of the use of regular expressions over time on various web sites. Then, we also take a snapshot of the current status. We find that although regular expressions are being used more and more often, their use is highly repetitive: on the most popular web sites only 4% of the regular expressions are unique. Based on this result, we show that web browsers can cut down regular expression compilation time to the third by applying caching techniques.
Keywords :
Java; Web sites; client-server systems; JavaScript; Web browsers; Web sites; caching techniques; dynamic language; regular expressions; server-client side scripting techniques; Automata; Browsers; Communities; Engines; Internet; Web pages; Computer languages; history; pattern matching;
Conference_Titel :
Web Systems Evolution (WSE), 2010 12th IEEE International Symposium on
Conference_Location :
Timisoara
Print_ISBN :
978-1-4244-8638-0
DOI :
10.1109/WSE.2010.5623572