Techniki zwinne w pracy analityka policyjnego
Transcription
Techniki zwinne w pracy analityka policyjnego
How can effects of R&D activities support actions in the areas of criminal intelligence analysis and computer forensics? Marek Kisiel-Dorohinicki, Jacek Dajda University of Science and Technology Department of Computer Science Forensic Software Laboratory www.fslab.agh.edu.pl Agenda • Who are we? • Our solutions • What is important to succeed? Who are we? Our team • Our team origins from Department of Computer Science at University of Science and Technology in Kraków • We soon understood that the strong team needs to be diversified Scientist (e.g. employees) Young researches (e.g. PhD students) Developers (e.g. former students) Domain experts (e.g. end-users) Technological experts (e.g. freelancers) What we do • From 2006 we specialize in development of tools for homeland security forces, such as Police, Border Guard, Public Prosecutors Office, Customs Service, Government Protection Bureau, Military Gendarmiere • We participated or participate (often as a leader) in 9 research & development projects, majority of which funded by National Centre of Research and Development • The results of these projects are working software products, used by our end-users or ready for deployment at customers sites Distribution of our solutions www.fslab.agh.edu.pl Our users • Currently over 1200 registered users • More than 1000 downloads of single version of our most popular product LINK • Over 40 signed licences for our products Our solutions Selected solutions Analytical databases FileSeeker LINK WebAlert Geospatial analysis LINK platform • Extensible platform for data processing, analysis and visualization • Heterogeneous sources of information and various formats and standards of the data • Data integration based on pre-defined and userdefined models (types of analytical objects and relations) • The use of pre-defined models allows for domainoriented processing Importing structural data • Mapping of data from tables (eg. spreadsheets, databases) into domain model • Automated discovery of formats • Data dalidation Data integration on diagrams • Visualization in form of interactive graph diagrams • Ability to connect data from different data sources • Identification of duplicated entries, data cleaning Data visualization using geographical maps • Handling objects and events which have geographical coordinates • Multi data series • Graphical editor which enables user to prepare final presentation • Graphical time filters • Ability to work offline Data visualization on a timeline • Handling any events with a time stamp • Multiple object lines • Graphical editor Focused statistics • User friendly filtering and sorting • Statistics related to time of event or relationships between objects • Identification of statistical anomalies Analysis of email traffic • Support for various formats of inboxes • Visualization in forms of tables and graphs • Ability to track the flow of attachments and communication between email addresses Analysis of bank accounts statements • Convenient import into predefined data model • Embedded configurable algorithms for pattern discovery Selected solutions Analytical databases FileSeeker LINK WebAlert Geospatial analysis Software for computer forensic Supported sources: • Typical disk image formats (RAW, DD, EnCase, AFF, AFD, AFM, mounted files and folders) • Partition formats: NTFS, FAT, UFS 1, UFS 2, EXT2FS, EXT3FS, EXT4, HFS, ISO 9660 • Support for unallocated disk space • Support for deleted files (Recycle Bin) Most important file formats • • • • MS Office, OpenOffice documents PDF documents Plain text files (encoding detection) Archives – 15 types (zip, 7z, rar, tar, gz, …) • Multimedia files (audio, video, graphics) • Email (e.g. MS Outlook, Thunderbird), • Web browsers cache and history Searching with the hit rate and context NLP algorithms Searching handles: • different forms of words • Typos • Can be extended with dedicated extensions (e.g. Polish language) Automated extraction of objects Supported object types: • addresses, • emails, • urls, phone numbers, • bank accounts, • passwords, • name of companies, • names and surnames Selected solutions Analytical databases FileSeeker LINK WebAlert Geospatial analysis Data collecting, searching and sharing Documents, reports Analytical objects, connections, cases Analysts Operator Advanced searching Administrator Support for documents processing • Documents registration and tagging • Attachments and comments management • Objects and connections extraction Named Entity Recognition • Automated discovery of analytical objects with NLP techniques (mainly Named Entity Recognition) • Support for entity linking and building relations between entities Full text searching • Based on NLP techniques • Searching through all data in the system in all attributes and fields • Highlighting precise hits and similar hits • Searching for connected objects Data visualization Graph of connections Analytical objects profiles Lists, tables, trees Advanced data model • Object-based • Easily configurable and extensible • Versioned Security • User roles and groups with hierarchies • Document-based permission management • Action log per every user Selected solutions Analytical databases FileSeeker LINK WebAlert Geospatial analysis GIS Tools • An extension to LINK platform • Offers dedicated GIS functions supporting specific cases related to geospatial analysis • Can be easily extended with new functions Interesting Points Of Interests • The idea is to quickly find POI of specific kind which are interesting for the current case • Example: finding CCTV cameras near the crime scene Identification of meeting places • Based on a time stamp and geographical location it is possible to identify potential meeting places Identification of similar tracks • Similar to places but extended for a series of points • Can be used to verify acquaintance of two persons based on their phone numbers billings • Can be used to compare tracks with a given pattern Selected solutions Analytical databases FileSeeker LINK WebAlert Geospatial analysis The tool concept HTTP / JS Searching and montoring web content • Searching for specific domains or starting from specific web pages • Configuration options: depth of searching, maximum number of pages, etc. • Advances language rules: different forms of words, searching for similar words, searching in neighbourhood • User dictionaries Support for secured and dynamic sources • Ability to macros (e.g. login into Facebook) • Executing JavaScripts • Automated clicking for web pages elements (e.g. Show more, Show comnments) • Automated scrolling for new content Data extraction based on templates • Extraction of important elements to a structured form: CSV, XLSX for further analysis • Example for an internet forum: author, content and data of a post • Can be very useful for internet auction portals Final thoughts What is import to succeed? • Open-mindness, flexibility (e.g. agile methods and techniques) • Close collaboration, team work, diversity • Proper focus on final product and goals • Balance between innovation and working software • Support, availbility, maintenance