Finding The Linchpins Of The Dark Web: A Study On Topologically .

1y ago

10 Views

1 Downloads

716.85 KB

15 Pages

Last View : 3d ago

Last Download : 3m ago

Upload by : Mika Lloyd

Report this link

Download PDF

Transcription

2013 IEEE Symposium on Security and PrivacyFinding the Linchpins of the Dark Web: a Study on Topologically Dedicated Hostson Malicious Web InfrastructuresZhou Li, Sumayah AlrwaisIndiana University at Bloomington{lizho, salrwais}@indiana.eduYinglian Xie, Fang YuMSR Silicon Valley{yxie, fangyu}@microsoft.comAbstract—Malicious Web activities continue to be a majorthreat to the safety of online Web users. Despite the plethoraforms of attacks and the diversity of their delivery channels,in the back end, they are all orchestrated through maliciousWeb infrastructures, which enable miscreants to do businesswith each other and utilize others’ resources. Identifying thelinchpins of the dark infrastructures and distinguishing thosevaluable to the adversaries from those disposable are criticalfor gaining an upper hand in the battle against them.In this paper, using nearly 4 million malicious URL pathscrawled from different attack channels, we perform a largescale study on the topological relations among hosts in themalicious Web infrastructure. Our study reveals the existenceof a set of topologically dedicated malicious hosts that playorchestrating roles in malicious activities. They are well connected to other malicious hosts and do not receive trafﬁcfrom legitimate sites. Motivated by their distinctive featuresin topology, we develop a graph-based approach that relieson a small set of known malicious hosts as seeds to detectdedicate malicious hosts in a large scale. Our method isgeneral across the use of different types of seed data, andresults in an expansion rate of over 12 times in detectionwith a low false detection rate of 2%. Many of the detectedhosts operate as redirectors, in particular Trafﬁc DistributionSystems (TDSes) that are long-lived and receive trafﬁc fromnew attack campaigns over time. These TDSes play criticalroles in managing malicious trafﬁc ﬂows. Detecting and takingdown these dedicated malicious hosts can therefore have moreimpact on the malicious Web infrastructures than aiming atshort-lived doorways or exploit sites.misdeeds. Indeed, such infrastructures become the backboneof the crimes in today’s cyberspace, delivering maliciousweb content world wide and causing hundreds of millionsin damage every year.Malicious Web infrastructures. Given the central rolethose infrastructures play, an in-depth understanding of theirstructures and the ways they work becomes critical for counteracting cybercrimes. To this end, prior research investigated the infrastructures associated with some types of channels (e.g., Spam [2], black-hat Search-Engine Optimization(SEO) [11]) and exploits (e.g., drive-by downloads [23]).What have been learnt includes the parties involved ina malicious campaign (e.g., afﬁliates, bot operators [28]),the underground economy behind such campaigns and howthese parties work together [15], [12]. Of particular interestis the discovery of extensive sharing of resources (e.g.,compromised systems, redirection servers, exploit servers)within some categories of malicious activities.With progress being made in this domain, still ourknowledge about the malicious Web Infrastructures is ratherlimited. Particularly, all these prior studies stay at individualredirection chains that deliver malicious content througha speciﬁc channel (e.g., spam [2], twitter [14], malvertising [16]) or lead to a speciﬁc type of illicit activities (e.g.,drive-by downloads, underground pharmaceutical business).What is missing here is an in-depth understanding of the bigpicture: what is the topological view of today’s maliciousWeb infrastructures, and how are malicious entities relatedto each other and the legitimate part of the Web, acrossdifferent redirection chains, different channels, and exploits?The answer to these questions could help us identify thelinchpins of the dark infrastructures, differentiating thosevaluable to the adversary from those expendable. As aresult, we will be able to build more effective and robusttechniques that disrupt malicious activities at their commonweak spots, without knowing their semantics and relying onany channel/attack speciﬁc features such as URL patternsthat often can be easily evaded by the adversary. Also,knowing the topological relations among malicious entities,one can make better use of what has already been detected)to discover other malicious parties.I. I NTRODUCTIONTechnological progress often comes with side effects.Look at today’s Web: not only does it foster a booming Webindustry, but it also provides new opportunities to criminalswho are rapidly industrializing their dark business over theWeb. Today once you unfortunately click a malicious URL,chances are that those who victimize you are no longerindividual, small-time crooks but an underground syndicate:some luring you to visit malicious websites through variouschannels (Spam, tweets, malicious advertising, etc.), somebuying and selling your trafﬁc through redirection, and thereceiving ends of the trafﬁc performing different exploits(drive-by downloads, scams, phishing etc.) on your systemon behalf of their customers. Such a complicated attack isorchestrated through malicious Web infrastructures, whichenable those miscreants to do business with each otherand utilize others’ resources to make money from their1081-6011/13 26.00 2013 IEEEDOI 10.1109/SP.2013.18XiaoFeng WangIndiana University at Bloomingtonxw7@indiana.eduTopologically dedicated malicious hosts. To gain furtherunderstanding of malicious Web infrastructures, we study112

nearly 4 million malicious URL paths crawled from differentfeeds, particularly the topological relations among the hostsinvolved and their connectivity with legitimate Web entities.Our key ﬁnding is the existence of a set of topologicallydedicated malicious hosts that play orchestrating roles in theinfrastructures. From the data we have, all URL paths goingthrough them are conﬁrmed to be malicious. These dedicatedmalicious hosts have a set of distinctive topological features.First, they seem to have strong connections with each otherby forming tight Host-IP Clusters (HICs) that share IP addresses and Whois information. Second, they are extensivelyconnected to other malicious parties, hosting over 70% ofthe malicious paths in our dataset. Finally, they are not foundto receive any legitimate inputs, though they may redirecttrafﬁc to legitimate parties, e.g., when they cloak.by downloads and scam in our data.Our work. Since these topologically dedicated hosts andtheir HICs play a central role in linking different maliciouspaths together, it becomes important to detect them forbreaking the malicious infrastructures. In our research, wecome up with a new topology-based technique designed tocatch these hosts without relying on the semantics of theattacks they are involved in. Intuitively, these dedicated hostsare rather easy to reach from the dark side of the Webwhile extremely hard to reach from the bright side. Thisobservation ﬁts perfectly with the concept of PageRank [3]:that is, they should be popular in the dark world but unpopular outside. Our approach starts with a relatively small setof known malicious HICs as seeds and a large number ofknown legitimate HICs as references, and propagates theirinitial scores across a Web topology using the PageRankalgorithm to compute legitimate and malicious scores forother unknown HICs. In the end, those highly endorsed bythe malicious hosts but way less so by the legitimate onesare identiﬁed as dedicated malicious HICs.Our approach works surprisingly well: in our evaluationbased upon 7-month data crawled from Alexa top websites [1], our approach detects about 5,000 new topologicallydedicated malicious hosts and over 20,000 malicious hostpaths that are not captured by existing solutions, at a falsedetection rate as low as 2%. Our study further reveals theroles, the operation models, and the monetization strategiesof these dedicated malicious hosts, particularly those thatwork as Trafﬁc Distribution Systems (TDSes), which areprofessional trafﬁc buying and selling systems that manageand keep record of trafﬁc-exchange transactions. Our majordetection results and interesting ﬁndings include: New ﬁndings. We conduct the ﬁrst study on topologicallydedicated malicious hosts, and discover their pervasivenessin malicious Web infrastructures and the critical roles theyplay. Our study reveals their topological features and the waythey are utilized. We show that TDSes play an important rolein managing and exchanging trafﬁc ﬂows the adversary usesto deliver malicious content and bring to the light how thesemalicious dedicated hosts evolve with time and how theyare monetized by domain owners through parking serviceseven after their domain names are taken down. For the set of dedicated malicious hosts that serve asTDSes, they are much more long-lived than doorways orexploit sites (65 days vs. 2.5 hours). They receive malicioustrafﬁc from new attack campaigns over time. Disrupting theiroperations has more long-lasting effects than taking downdoorways or exploit sites. Our study shows that even after TDSes are taken down,they continue to receive a large amount of trafﬁc, 10times more than legitimate parked domains. Such trafﬁc isleveraged by domain owners through parking services tocontinue to gain revenues from ad networks.Contributions. The contributions of the paper are summarized as follows: New techniques. We develop a new technique that expandsfrom known malicious seeds to detect other malicious dedicated hosts, based upon their unique features. Our approachworks effectively on large-scale real data, capturing a largenumber of new malicious hosts at a low false detection rate.Roadmap. The rest of the paper is organized as follows:Section II presents the background information of ourresearch, including how data was collected; Section IIIdiscusses a measurement study over the data, which revealsthe important role played by dedicated malicious hosts andtheir prominent features; Section IV describes the designand implementation of our detection technique; Section Vevaluates our detection system on its efﬁcacy; Section VIreports a study on all the malicious dedicated hosts wefound; Section VII discusses a few issues of our studyand potential future work; Section VIII reviews the relatedresearch and Section IX concludes the paper.II. DATA C OLLECTIONIn this section, we explain the data collection process andthe methodology we use to label data for our study. Thisprocess serves two purposes: (1) It helps us prepare data forbuilding the Web topology graph (Section III); (2) It labelsknown malicious and legitimate portions of the Web usingexisting techniques, so that we can study their distinctivetopological features for detection. Later in Section IV, weshow how we can leverage the topological features learnedduring this process to detect malicious URLs and hosts notidentiﬁed before. Our algorithm achieves a high detection rate. Even with asmall set of seed malicious HICs (5% of the labeled ones),we can discover a large number of other malicious HICs,with an expansion rate of 12 times. Our detection algorithm is general across the use ofdifferent malicious seeds, including drive-by downloads andTwitter spam in our experiments. It can also detect malicioushosts set up through different attack channels, such as drive-113

A. Data SourcesFeedStartEnd# Doorway URLsDrive-by-downloadWarningbirdTwitterTop 0121,558,690358,2321,613,9242,040,720triggered by Javascript code, which is not speciﬁed upfrontby any HTTP and HTML ﬁelds. This problem is tackled inprior research [23] by simply searching the script code tolook for the string similar to the URLs involved in the HTTPrequests produced after running the code: if the edit distancebetween the URL in the request and part of the content thescript carries is sufﬁciently small, a causal relation is thoughtto be found and the URL of the document hosting the scriptis connected to the request.A problem for the approach is that it cannot capture a redirection when the adversary obfuscates the JavaScript code,which is common on today’s Web: what has been foundis that increasingly sophisticated obfuscation techniqueshave been employed to evade the detection that inspectsredirections [5]. To mitigate this threat, we resort to dynamicanalysis, instrumenting all JavaScript DOM APIs that canbe used to generate redirections, e.g., document.write.When such an API is invoked, our crawler inspects the callerand callee to connect the URLs of their origins.To increase our chance of hitting malicious websites,we set the crawler’s user-agent to IE-6, which is knownto contain multiple security-critical vulnerabilities. In addition, to avoid some malicious servers from cloaking tothe requests with an empty Referral ﬁeld [7], we setthe Referral ﬁeld of the initial request for each URL tohttp://www.google.com/. After visiting a web page,the crawler also cleans cookies to avoid tracking.Table IDATA FEEDS USED IN THE STUDY.We use four different data feeds to bootstrap data collection. Each data feed includes a set of doorway URLs thatwe leverage to crawl and analyze the redirection topology.Our data feeds include: Drive-by-download feed: Microsoft provides us witharound 30 million doorway URLs that were found to delivermalicious contents (mostly drive-by downloads) and wesample 5% (1.5 million) from them for study. Warningbird feed: We download the malicious URL setposted by the WarningBird project [14]. This set includesover 300k suspicious URLs in Twitter spam. Twitter feed: We run Twitter Search APIs [29] to pick top10 trending terms every day. We use these terms to collectrelated tweets and extract all the URLs they contain. Thisprocess gives us 1.6 million URLs. Top-site feed: We gather Alexa top 1 million web sitesand update them every week. We obtain 2 million URLs intotal, most of which are legitimate.All together, we have gathered about 5.5 million initialURLs, which serve as inputs to a set of crawlers describedbelow during a 7-month period to collect data.C. Data LabelingFor each visit, our crawler generates a set of URLs andconnects them according to their causal relations, whichgives us a set of URL paths in terms of URL redirectionchains. From URL paths, we further derive a set of hostpaths that keep only host names along the redirection chains.We proceed to label all the crawled URLs, URL paths, andhost paths as malicious or legitimate using a combination ofexisting tools and methods.B. Redirection Chain CrawlingWe deploy 20 crawlers, each hosted on a separate Linuxvirtual machine with a distinctive IP address, to explorethe URL redirection paths of these 5.5 million doorwayURLs. Each crawler is built as a Firefox add-on, whichkeeps track of all the network requests, responses, browserevents and page content it encounters in a visit. Based onsuch information, our approach automatically reconstructs aredirection URL path for the visit, which links all relatedURLs together.More speciﬁcally, such URL paths are built in a mostlystandard way, similar to Google’s approach [23] exceptthe part for analyzing Javascript. Our approach detectsredirections from HTTP status code (e.g. 302), Meta refreshtag, HTML code (e.g., iframe tag) and JavaScript. The dots(URLs) here are connected using different techniques underthese different types of redirections. Actually, Firefox givesaway the source and destination URLs through browserevents when the redirection has HTTP 3xx status code or istriggered by Meta refresh, which allows us to link the sourceto the destination. For those caused by HTML, we can ﬁndout the URL relation according to the Referral ﬁeld of thedestination URL. What gives us trouble is the redirectionForefront Alarmed PageLegitimate Ad-networkIframe1Doubleclick.comMalicious RedirectorIframe2pokosa.comExploit Servergoodp.osa.plFigure 1. Redirections from a Forefront alarmed Page. The ﬁrst redirectionpath is not marked as malicious since it leads to only a legitimate party. Thesecond redirection path is a malicious path as the redirection is generatedfrom an iframe injected by an attacker.Labeling of malicious URLs and paths. Speciﬁcally, weﬁrst use Microsoft Forefront, an Anti-Virus scanner, to scanthe contents associated with individual URLs encountered114

it does is to provide a set of URLs and paths that areconﬁrmed malicious or legitimate based on existing tools(e.g., Forefront, whitelists). The rest of the URLs (78.51%)are treated as unknown and our goal is to come up with amethodology for automatically detecting malicious partiesfrom them.during the crawling.1 Once a node (i.e., a URL) is ﬂaggedby the scanner as containing malicious contents (typicallycode), the URL is labeled as malicious. The data crawledfrom Alexa top sites and Twitter feeds yield mostly legitimate URLs. The data crawled from the drive-by downloadand the Warningbird feeds, however, do not always yieldmalicious URLs for each visit. The reason is that drive-bydownload doorway URLs are sometimes hosted on compromised hosts, which may have already been cleaned up whenwe visit them. Therefore, the scan we perform helps avoidfalsely marking them as malicious.Once we label a URL as malicious, we treat all the URLpaths going through it as suspicious paths. However, notall suspicious paths are malicious. For example, a maliciousdoorway page may lead to multiple paths, and only oneof them leads to exploits. This happens when a maliciousdoorway page contains multiple URLs, some of whichare legitimate and redirect to other legitimate sites (e.g.,doubleclick), as illustrated in Figure 1. To avoid markingthem as malicious, we further inspect whether there existsanother non-doorway URL on a suspicious path also markedas malicious. If so, we label the corresponding path as amalicious path. For the paths whose doorway pages directlycontain exploit code, we label these paths as maliciouswithout the need of examining other URLs. If all the URLpaths corresponding to a host path are labeled as malicious,we label the host path as malicious as well.Drive-by-downloadWarningBirdTwitterTop 21250,6271,079,9036,8713,10023,2191,111,104III. T OPOLOGY- BASED M EASUREMENTSIn this section, we study the properties of malicious URLsand host paths. We focus on examining the topologies andthe connections of malicious and legitimate entities. Ourmeasurements reveal the existence of a set of topologicallydedicated hosts that play critical roles in malicious activities.The unique properties of these hosts inspire us to developa graph-based detection approach, which can capture thesehosts without any information about their semantics, e.g.,the content they accommodate or the code they run.A. Hostname-IP Cluster (HIC) ConstructionTo study Web entity topologies, one natural way is toexamine individual URLs or hostnames. However, priorresearch shows that attackers often register hundreds ofmalicious hostnames, all pointing to a small set of IPaddresses under one domain registrar [19]. Once a hostnameis detected, attackers can quickly switch to another one inthe pool. From the topology perspective, the signals of suchshort-lived individual URLs or hostnames may not be strongenough to distinguish them.Instead, we explore the groups of URLs or hostnamesthat are controlled by the same attackers. For this purpose,we construct Hostname-IP Clusters (HICs) that capturethe intrinsic sharing relations between hostnames and IPaddresses. The concept of HICs has been used in priorresearch [33] to detect servers that play central roles indrive-by download campaigns. A problem of their deﬁnition is that it is solely based upon the relations betweenIPs and hostnames, which does not work well on today’sWeb, where attackers increasingly utilize hosting or cloudservices. When this happens, all the hosts running on a cloudserver will be clustered together.Our solution is to use the Whois information [31] to guidethis clustering process: two hosts sharing IPs are consideredto be related only if their domain names are from the sameregistrar. Since malicious hosts strongly prefer low-cost, lesswell known registrars (see Section III-C), this treatmentturns out to be very effective. More precisely, our HICconstruction process is as follows:I We assign a unique HIC instance to every hostname.II We start to merge these HICs in a similar way to thatin prior work [33]. The construction process iterativelyinspects every pair of HICs. We ﬁrst compute theoverlapping of their IPs. Let IP S1 be the IP set forHIC H1 , and IP S2 be that of HIC H2 . H1 and H2are considered to be merged if the Jaccard distanceIP S1 IP S2IP S1 IP S2 is larger than a threshold TIP S . SimilarTable IIDATA STATISTICS AFTER LABELING .Labeling of legitimate URLs. We also label the remainingURLs that correspond to reputable domains or known adservices as legitimate URLs. To do so, we ﬁrst cluster thenon-malicious URLs based on their domains and manuallyexamine the URL clusters with over 1,000 URLs each.Among these clusters, we identify 19 reputable ones, such asgoogle.com and facebook.com, and we use them tolabel legitimate URLs. In addition, we use EasyList [21] andEasyPrivacy [22] to identify ad-networks and trackers. Thesetwo lists are also utilized by the popular browser pluginAdblock plus [20] to block ads and tracking scripts. Finally,since URL shorteners (e.g., t.co) are extensively used byTwitter users to embed URLs in Tweets, we also identifythem using a known list compiled for this purpose [18].Of course, this labeling process is not exhaustive. All1 We do not use Google Safebrowsing because a reported malicious URLmay be hosted on a compromised site and already be cleaned by the timeof our crawl.115

connectivity, we build an HIC graph by linking two HICswith a directed edge if there is an URL redirection betweentheir hosts. In total, we have 1,951,313 HIC nodes and9,058,597 edges on the HIC graph.Closely examining these dedicated malicious HICs, weﬁnd that they are highly intertwined: among 15,273 dedicated malicious HICs, 12,942 (84.74%) are located on afully connected subgraph. The dedicated malicious HICs arealso intensely connected with other non-dedicated maliciousHICs: 80.40% of non-dedicated malicious HICs are directlyor indirectly connected to at least one dedicated HIC. Thisobservation indicates that the dedicated malicious HICs arequite easy to reach from the “dark” world. Starting from afew malicious URLs and following their redirect chains, youmay easily reach some dedicated malicious HICs.to [33], we set this threshold to 0.5, to accommodate theIP variations caused by content-distribution networks(CDN) and fast-ﬂuxing [10]. Besides this criterion, wetake an additional step to check their Whois information. Only if their registrars are also identical can wemerge them together. The above process iterates untilno HIC pairs can further merge.Figure 2 illustrates this process. HIC1 and HIC2 can bemerged since their IP address overlapping is 60% and theyhave the same registrar. HIC3 is not merged with any otherHICs because its registrar is different from others.Registrar1Hostname1 IP4IP3HIC3Hostname3Fraction of 2HIC2Figure 2.HIC generation process.0.80.6Dedicated HIC0.4Non-dedicated HIC0.20024681012# Link-in Legitimate HICB. Topologically Dedicated Malicious HICsAll together, we obtain 1,951,313 HICs using the abovemethod from our data. Among them, 15,273 are found toonly host conﬁrmed malicious URL paths (and the corresponding host paths) in our datasets (collected over a 7month period). This topological property differentiates themfrom other HICs, which contain at least one URL path thatwe cannot conﬁrm. We call the former dedicated maliciousHICs and the latter non-dedicated malicious HICs.These dedicated HICs apparently play a critical role inthe malicious activities: they are attached to 76.2% of themalicious paths across all the data sources in Table II. Although we have no ground truth about whether the dedicatedmalicious HICs are indeed set up by malicious parties, weﬁnd that their hostnames usually exhibit patterns of domainrotations and that they are often registered under unpopulardomain registrars2 . Table III lists the top 10 (ranked by thenumber of paths going through) dedicated malicious HICsin our datasets. Such observations suggest that these HICsmay correspond to dedicated hosts that are set up for justmalicious uses, e.g., “central servers” for drive-by downloadcampaigns [33].Figure 3.CDF of the number of Legitimate Link-in HIC betweenDedicated HICs and Non-dedicated HICsFraction of HICs10.80.6Dedicated HIC0.4Non-dedicated HIC0.20024681012# Link-out Legitimate HICFigure 4.CDF of the number of Legitimate Link-out HIC betweenDedicated HICs and Non-dedicated HICsIn contrast, these dedicated malicious HICs rarely receivetrafﬁc from legitimate or unknown parties (labeled by themethodology in Section II-C), even when these legitimateparties do appear on malicious paths. In terms of such a“link-in” relation, the dedicated malicious HICs are moreremote to legitimate parties than non-dedicated maliciousHICs. Figure 3 shows that 97.75% of the dedicated maliciousHICs do not receive any trafﬁc redirections from legitimateHICs. For the rest 2.25% of dedicated malicious HICs thatdo, they mostly correspond to malicious entities that haveC. Graph Properties of Dedicated Malicious HICsWhen we examine the inter-connections among HICs, weﬁnd that these dedicated HICs are not isolated. Instead,they tend to connect to each other. To understand their2 According to [6], the ﬁve best domain providers are NameCheap, 1&1,Go Daddy, Name and Gandi.116

u, vznrahwzgntmfcqk.ru, .viagrabuytoday.com, buycialistodoors.com, .searchstr.com, ssrsearch.comsqwlonyduvpowdgy.ru, qlihxnncwioxkdls.ru, .tadalaﬁl-mastercard.ordercialisonlineus.com, viagra-brand-viking.cialisshopsale.com, .soxfurspwauosdis.ru, iqsxbaoyzweerppq.ru, .freshtds.eupuvbgoizrqsxsxzq.ru, fkffqgkqfqdxekvq.ru, .michaelmazur.netNAUNET-REG-RIPNINTERNET.BS CORPINTERNET.BS CORPNAUNET-REG-RIPNINTERNET.BS CORPNAUNET-REG-RIPNPDR Ltd.NAUNET-REG-RIPNTUCOWS.COM CO.Table IIIT OP RANKED HIC Sdrive-by downloads) of speciﬁc types of attacks or speciﬁcdata sources for detection, our approach utilizes only thetopological information of malicious Web infrastructures.An important advantage of this approach is that it workson different types of attacks and different sources of data,regardless whether the attack is drive-by download, scam,or is carried through spam tweets [14] or malvertising [16],as long as it exhibits the same topological properties usedfor detection, which in our case is the connectivity of dedicated malicious HICs. Moreover, such an approach can bemore difﬁcult to evade by active adversaries: the dedicatedmalicious HICs could cloak to the crawlers, redirectingtrafﬁc to google.com, but they cannot easily change theirconnection structures to receive more trafﬁc from legitimatehosts or less trafﬁc from other malicious hosts.inﬁltrated legitimate ad networks (e.g., Doubleclick) andreceive trafﬁc from them [16]. By comparison, 25.70% ofnon-dedicated malicious HICs receive trafﬁc redirectionsfrom other legitimate HICs. This observation shows thatcompared to legitimate or non-dedicated malicious HICs,the topologically dedicated malicious HICs are much harderto reach from the bright side of the Web.In terms of the “link-out” relations, dedicated maliciousHICs are less likely to redirect trafﬁc to legitimate HICs.This usually happens when those malicious parties cloak.Figure 4 shows that 28.30% of the dedicated malicious HICsredirect their visitors to legitimate hosts, compared with61.53% of non-dedicated malicious HICs that do the same.The graph properties of these dedicated malicious HICsshow that they are well connected and easy to reach fromknown malicious URLs, but they are much harder to getto from legitimate ones. This observation provides strongimplications for developing the right technique to detectthem. Particularly, the well-known PageRank algorithm ﬁtswell with such topological properties, and therefore we adoptit in our research to detect those hosts without relyingon their semantic information. Note that what we focuson here is dedicated malicious HICs. Those non-dedicated,particularly compromised hosts, may not have such graphproperties. As a result, the PageRank approach may not beapplicable to ﬁnd them. In the next section, we explain thethis detection method in detail.A. PageRank-based DetectionThe connectivity features of the dedicated malicious HICsare well captured by the concept of PageRank [3], a technique widely used to evaluate the importance of web pages.In the web site ranking scenario, a web page is consideredto be important and therefore has a high rank if it is wellconnected, easy to reach from other (randomly-selected)pages. This rank is computed by propagating the initial scoreof each web page across a directed hyperlink graph anditeratively updating the page’s score based on the ranks ofthe pages that link to it. This idea has also been appliedto detect web spam pages [9], comment spams [32] andspammers on social graphs [4].In our case, what makes the dedicated malicious HICsunique is their unbalanced connections from (dedicated ornon-dedicated) malicious HICs v.s. those from legitimateones. Using PageRank as the yardstick, malicious HICs gethigh ranks from the dark Web and low ranks from the brightside of the Web. Therefore, our idea is to compute tw

scale study on the topological relations among hosts in the malicious Web infrastructure. Our study reveals the existence of a set of topologically dedicated malicious hosts that play orchestrating roles in malicious activities. They are well con-nected to other malicious hosts and do not receive trafﬁc from legitimate sites.

Related Documents:

Nonprofit Self-Assessment Checklist

May 02, 2018 · D. Program Evaluation ͟The organization has provided a description of the framework for how each program will be evaluated. The framework should include all the elements below: ͟The evaluation methods are cost-effective for the organization ͟Quantitative and qualitative data is being collected (at Basics tier, data collection must have begun)

1.4K Views

2y ago

Name of thé élément in thé language and script of thé ... - UNESCO

Silat is a combative art of self-defense and survival rooted from Matay archipelago. It was traced at thé early of Langkasuka Kingdom (2nd century CE) till thé reign of Melaka (Malaysia) Sultanate era (13th century). Silat has now evolved to become part of social culture and tradition with thé appearance of a fine physical and spiritual .

117 Views

9m ago

[Kl - Mauritius

On an exceptional basis, Member States may request UNESCO to provide thé candidates with access to thé platform so they can complète thé form by themselves. Thèse requests must be addressed to esd rize unesco. or by 15 A ril 2021 UNESCO will provide thé nomineewith accessto thé platform via their émail address.

470 Views

1y ago

Employee Benefits Event - Schneider Downs Tax Services

̶The leading indicator of employee engagement is based on the quality of the relationship between employee and supervisor Empower your managers! ̶Help them understand the impact on the organization ̶Share important changes, plan options, tasks, and deadlines ̶Provide key messages and talking points ̶Prepare them to answer employee questions

329 Views

1y ago

Study Investigating thè Effect of E- Service Quality on Customer's ...

Dr. Sunita Bharatwal** Dr. Pawan Garga*** Abstract Customer satisfaction is derived from thè functionalities and values, a product or Service can provide. The current study aims to segregate thè dimensions of ordine Service quality and gather insights on its impact on web shopping. The trends of purchases have

127 Views

9m ago

Kinh Giải Thâm Mật HT. Thích Trí Quang dịch giải

Chính Văn.- Còn đức Thế tôn thì tuệ giác cực kỳ trong sạch 8: hiện hành bất nhị 9, đạt đến vô tướng 10, đứng vào chỗ đứng của các đức Thế tôn 11, thể hiện tính bình đẳng của các Ngài, đến chỗ không còn chướng ngại 12, giáo pháp không thể khuynh đảo, tâm thức không bị cản trở, cái được

1.6K Views

3y ago

1 REFERENCES GENERALES 2 - bourre

Le genou de Lucy. Odile Jacob. 1999. Coppens Y. Pré-textes. L’homme préhistorique en morceaux. Eds Odile Jacob. 2011. Costentin J., Delaveau P. Café, thé, chocolat, les bons effets sur le cerveau et pour le corps. Editions Odile Jacob. 2010. Crawford M., Marsh D. The driving force : food in human evolution and the future.

988 Views

3y ago

jean-marie-bourre bienvenue

Le genou de Lucy. Odile Jacob. 1999. Coppens Y. Pré-textes. L’homme préhistorique en morceaux. Eds Odile Jacob. 2011. Costentin J., Delaveau P. Café, thé, chocolat, les bons effets sur le cerveau et pour le corps. Editions Odile Jacob. 2010. 3 Crawford M., Marsh D. The driving force : food in human evolution and the future.

1.0K Views

3y ago

Recent Views

PHONE NO. CONTACT TOPIC/SUBTOPIC ORGANIZATION #A

651-757-2762 Deborah Klooz MPCA Paralegal: 651-757-2631 Jean Coleman MPCA Staff Attorney: 651-757-2791 Adonis Neblett MPCA Staff Attorney: 651-757-2017 Carmen Netten MPCA Staff Attorney: 651-757-2759 David Stellmach MPCA Staff Attorney: 651-757-2247 Joseph Dammel MPCA Staff Attorney: 651-757-2545 Michelle Janson MPCA Staff Attorney: #ATTORNEY .

2y ago

403 Views

Local Prosecutors and The Attorney General

Attorney General of Iowa Other Members iii Honorable Arthur K. Bolton Attorney General of Georgia Honorable Chauncey H. Browning, J 1'. Honorable John C. Danforth Attorney General of Missouri Honorable J olm P. Moore Attorney General of Colorado Attorney General of West Virginia Honorable Larry Derryberry Attorney General of Oklahoma

1y ago

178 Views

30th Annual Anti-Fraud Conference Tentative Schedule

Apr 30, 2019 · Jill Nerone, Supervising Deputy District Attorney, Alameda County District Attorney’s Office Laura Meyers, Assistant District Attorney, San Francisco County District Attorney’s, Office Nicole Pantaleo, Deputy District Attorney, Marin County District Attorney’s Office, Insurance F

2y ago

150 Views

Shannon McClellan Hon. Diane O. Leasure Ellery M. “Rick .

Attorney at Law Hon. Pamila J. Brown BOG Liaison District Court, Howard County Alan S. Carmel Attorney at Law Sarah Dawn Cline Attorney at Law Adam Sean Cohen Attorney at Law Delegate Kathleen M. Dumais District 15 Suzanne K. Farace Attorney at Law Barry L. Gogel Attorney at Law Michael I. Gordon

2y ago

142 Views

Powers of Attorney Act 2003 A Commentary - Law Society of New South Wales

POWERS OF ATTORNEY ACT 2003: A COMMENTARY 6 POWERS OF ATTORNEY ACT 2003: COMMENTARY The commentary is provided in black text. Reference to the "Act" is a reference to the Powers of Attorney Act 2003 as amended. Reference to the "Regulation" is a reference to the Powers of Attorney Regulation 2011, recently amended by the Powers of Attorney Amendment Act 2013 and the Powers of

7m ago

94 Views

California Safe Drinking Water and Toxic Enforcement Act .

District Attorney of Madera County 209 West Yosemite Avenue Madera, CA 93637 District Attorney of Marin County 3501 Civic Center Drive, Rm. 130 San Rafael, CA 94903 District Attorney of Mariposa County P.O. Box 730 Mariposa, CA 95338 District Attorney of Mendocino County P.O. Box 1000 Ukiah, CA 95482 District Attorney of Merced County

3y ago

163 Views

IN THE UNITED STATES COURT OF APPEALS FOR THE FIRST

Mar 06, 2020 · Attorney General of New Jersey Assistant Attorney General Counsel of Record Attorney for Amicus Curiae JOHN T. PASSANTE State of New Jersey Deputy Attorney General New Jersey Attorney General’s Office Richard J. Hughes Justice Complex 25 Market Street Trenton, NJ 086

2y ago

128 Views

ATTORNEY HANDBOOK - United States Courts

e. Each attorney's or pro se litigant's name must be typed and signed on the last page of the complaint, with: (1) his/her address (2) telephone number (3) if a Pennsylvania attorney, his/her Pennsylvania Attorney ID Number f. To file a complaint, the attorney must have an electronic signature on the complaint and must have an electronic

1y ago

124 Views

Power of Attorney - FedEx

Show the date the Power of Attorney is signed. Corporation Power of Attorney Partnership 1 10 9 8 7 6 5 4 3 2 12 11 1 10 9 8 7 6 5 4 3 2 12 11 1 10 9 8 7 6 5 4 3 2 12 11 Rev 6/13 The number preceding each instruction corresponds to the same number on the example of the power of attorney form. Customs Power of Attorney, Designation as Export .

1y ago

157 Views

Powers of Attorney - Ontario

attorney, a family member or friend may have to apply to be appointed as guardian. Powers of attorney that were properly made under previous laws of Ontario remain legally valid. The forms for a Continuing Power of Attorney for Property and a Power of Attorney for Personal Care contained in this booklet were revised on March 29, 1996 in accordance

1y ago

155 Views

STATUTORY POWER OF ATTORNEY - eForms

repudiated the power of attorney; and the power of attorney still is in full force and effect. 5. I/we make this affidavit for the purpose of inducing _ to accept delivery of the above described instrument, as executed by me/us in my/our capacity of attorney(s)-in-fact for the Principal. _, Attorney-in-fact

1y ago

118 Views

John J. Hoffman Acting Attorney General of New Jersey

JOHN J. HOFFMAN ACTING ATTORNEY GENERAL OF NEW JERSEY Division of Law 124 Halsey Street — 5th Floor P.O. Box 45029 Newark, New Jersey 07101 Attorney for Plaintiffs By: Jah-Juin Ho - #033032007 Deputy Attorney General 973-648-2500 JOHN J. HOFFMAN, Acting Attorney General of the State of New Jersey, and ERIC T.

1y ago

89 Views

Options in Oregon to Help Another Person Make Decisions

Power of Attorney A “Power of Attorney” is a legal document that allows a person to give another person (called an “agent”) the right to act on the person’s behalf. A “Power of Attorney” in Oregon can only be used for financial decisions. The way a “Power of Attorney” is written is important. The authority given to the agent can

3y ago

134 Views

- fcdfa

FRESNO COUNTY SUPERIOR COURT By DEPT.402 JAN SCULLY District Attorney, County of Sacramento RUTH YOUNG, State Bar No. 133606 Deputy District Attorney 906 G Street, Suite 700 Sacramento, CA 95814 Telephone: (916) 874-6174 JACKIE LACEY District Attorney, County of Los Angeles STUART C. LYTTON, State Bar No. 114241 Deputy District Attorney

3y ago

136 Views

Non-Attorney E-File Registration

your motion for e-filing access. Instructions to submit the Non-Attorney E-File Registration: 1. Register for a Non-Attorney Filer Account on the PACER website at www.pacer.uscourts.gov. If you already have a PACER Account, login to Manage My Account, select Non-Attorney E-File Re

2y ago

181 Views

Finding The Linchpins Of The Dark Web: A Study On Topologically .

It looks like you're using an ad-blocker