• DocumentCode
    3342539
  • Title

    Data retrieval from online social network profiles for social engineering applications

  • Author

    Alim, S. ; Abdul-Rahman, R. ; Neagu, D. ; Ridley, M.

  • Author_Institution
    Dept. of Comput., Univ. of Bradford, Bradford, UK
  • fYear
    2009
  • fDate
    9-12 Nov. 2009
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    With the increased use of online social networking sites, data retrieval from social networking profiles is becoming a major tool for business. What makes social networking profile data different is its semi-structured format. The structure and the presentation of profile data change all the time. In social networking there is a lack of research into automated data retrieval from semi-structured Web pages. Our approach is based on automated retrieval of the profile´s attributes and list of top friends from MySpace by examining and extracting the relevant tokens in the parsed HTML code. The tokens were placed into a repository and Breadth First Search algorithm was used. The approach was implemented and tested with a profile which resulted in over 800 top friend profiles and attributes being extracted. This implementation process highlighted that MySpace profile structures vary depending on profile type and the way in which the user has customised the profile.
  • Keywords
    Web sites; commerce; information retrieval; social networking (online); tree searching; MySpace; automated data retrieval; breadth first search algorithm; business; online social networking sites; parsed HTML code; semi-structured Web pages; semi-structured format; social engineering applications; social networking profiles; Computer networks; Data analysis; Data engineering; Data mining; HTML; Information retrieval; MySpace; Social network services; Web pages; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Internet Technology and Secured Transactions, 2009. ICITST 2009. International Conference for
  • Conference_Location
    London
  • Print_ISBN
    978-1-4244-5647-5
  • Type

    conf

  • DOI
    10.1109/ICITST.2009.5402568
  • Filename
    5402568