Protecting Databases From Inference Attacks*

3y ago

33 Views

3 Downloads

3.71 MB

22 Pages

Last View : 22d ago

Last Download : 3m ago

Upload by : Melina Bettis

Report this link

Download PDF

Transcription

Computers & Securiry Vol. 16, No. 8, pp.687-708, 19970 1997 Else&r Science LimitedAll rights reserved. Printed in Great Britain0167-4048197 17.00ELSEVIERProtectingdatabases frominferenceattacks*Thomas H. Hinke, Harry S.Delugach and Randall F?WolfComputer Science Department, The University ofAlabama in Huntsville,Huntsville, AL 35899, Phone: (205) 895-6455 FAX: (205) 8956239, E-mail: thinke@xuah.edu,delugach@xuah.eduThis paper presents a model of database inference and a taxonomyof inference detection approaches.TheMerlin inference detectionsystem is presented as an example of an automated inference analysis tool that can assess inference vulnerabilities using the schemaof a relational database. A manual inference penetration approachis then offered as a means of detectinginstances of data or characteristicsinferences that involveof groups of instances. Theseprise. Using available secure database management systems, an enterprise has the ability to provide variousdegrees of protection for the data.This protection canrange from access lists to label-based protection, wheresecurity labels are assigned to the data based on its sensitivity. Access to this data is mediated based on theprivileges of those who attempt to access it.two approaches are offered as practical approaches that can beapplied today to address the database inference problem.Thefinalsection discusses future directions in database inference research.Keywords:computer security, database inference, database security,inference detection tools, inference detection analysis1. IntroductionA database holds a great amount of data that is criticalfor the operation of enterprises, be they commercialor government.Thisdata, while providing crucial support for the mission of the enterprise, can also providea source of sensitive information that is useful forthose who are competitors or adversaries of the enterlThis work was supported under Maryland Procurement 05ceNo. h4DA904-94-C-6120.ContractUnfortunately, properly protecting individual portionsof the database may not provide complete protection.A competitor or adversary may be able to use data thatin isolation appears to be properly protected to inferdata that is highly sensitive.The problem for the enterprise is to discover these inferences, and then to takenecessary countermeasures to close them.The general solution to the inference problem is difficult, since an adversary can apply a deep body ofknowledge in performing an inference attack. Anyadversary must be assumed to possess an extensiveeducational background, as well as familiarity withthe specific domain of knowledge of his intendedattack. AU of this knowledge can be applied in performing the inference attack. The implication of this687

TH. Hinke et al. / Protecting Databases From Inference Attacksis that the protectors of the database must also applythis deep knowledge to their inference analysis inorder to discover the vulnerabilities of their databasebefore they are discovered by their adversary. Whilework is proceedingto address database inferencedetection in light of the deep knowledge required[l], the results of this work are still in the researchphase. However, the fact that deep knowledge isrequired to address the general inference problemfully does not mean that there are no practical techniques available today to apply to important segments of the problem. It is the objective of this paperto describe these practical techniques.In order to provide both a context for this discussionand a means to help those with responsibility fordatabase protection to understand some of the subtleimplications of the inference problem, Section 2 ofthis paper will present a model of the inference problem that has been developed as part of the AERIEinference project at the University of Alabama inHuntsville, USA.This model provides a useful meansof visualizing the various data vulnerabilities associated with the inference problem, as well as terminology for addressing methods for countering the problem. Section 3 will then consider a framework forcategorizing the techniques that can be applied tothe design of databases in order to reduce their vulnerability to inference attack. Section 4 will discusscurrent state of the art automated inference techniques, and Section 5 will present manual techniquesthat can be applied. Finally, Section 6 will contain abrief discussion of future techniques that are still inthe research stage.2. Characterization of inferenceVulnerabilities in DatabasesThe AERIEinferenceresearchprojectat theUniversity of Alabama in Huntsville has developed amodel of the inference problem that is called AERIE(Activities, Entity, RelationshipsInference Effects).This model assumes that an adversary desires certaindata that is the target of his or her inference attack.This target, which is referred to as the sensitive target,can be expressed in terms of the constructs of theAERIE model. This model augments the entity-rela-tionship (ER-model) developed by Chen that is commonly used for database modelling [2].The AERIE model characterizes possible inferencetargets in terms of entities, activities and various relaistionships[3].A n entity, as in the ER-model,some thing that has existence and can be distinguished from other things. Entities are the nounsin the AERIE model. Activities are the verbs andthey indicate actions. Relationships are used to represent various associationsbetweenentities andactivities. Using the model to represent various possible inference targets, we have the following types oftargets:Entity Materialization: this represents an inferencethat detects the existence of an entity or some characteristics of an entity. An example of this type of inference would be to infer that the entity growing seasonis underway within a farming community based onthe nature of items that show up in a point-of-saledatabase, such as fertilizer, seed or pesticides.Activity Materialization: this represents an inference that detects the existence of an activity. Forexample, one can infer that a winter mountain climbing expedition is about ready to occur based on theordering of relevant equipment, such as an ice axe,cross country skis or low-temperature sleeping bags.Entity-EntityRelationship:this represents aninference of a relationship between two entities. Anexample is the ability to infer the companies that aresupporting a very sensitive project, based, for example,on an employee for the company attending a meetingfor the project.Activity-ActivityRelationship: this represents asensitive relationship between activities. For example,the fact that the activity of cotton picking hasoccurred can be deduced by the fact that a cottongin’s database shows daily ginning activity.Entity-ActivityRelationship: this represents aninference that detects a sensitive relationship betweenan entity and an activity. An example of this could beinferring that a company was adopting a new process

Computers & Security, Vol. 16, No. 8for the manufacture of computer chips based on itsordering of particular types of equipment.Relationship-RelationshipRelationship:in thistype of inference, it is the relationship between relationships that is being inferred. For example, in aclassroom setting, the posting of student grades alongwith a student number would be a relationship. Asorted list of student names would also be a relationship. A sensitive relationship between these two listscould be knowledge that the same sorting algorithmwas used in both cases. With this information onecould easily deduce the grade associated with eachstudent’s name, which is highly sensitive. Anothertype of relationship-relationshiptarget is the abilityto infer some rule (a type of relationship) that hasbeen applied to the data. For example, by scanningthe list of ages for members of a retirement community one could infer a rule that each member mustbe at least 55 years old.These various types of inference targets represent entities, activities and relationships that occur in the ld. This microworld selectively represents aportion of the real world that is relevant to the enterprise that maintainsthe database. Withinthemicroworld of the database, sensitive targets that are tobe protected by an enterprise are represented by whatwe call signatures.The signature represents the manifestation of the real world inference target within themicroworld of the database. For example, the ordersfor the mountain climbing expedition represent thesignatures of the expedition in the database of theequipment retailer.The relationship between the inference signature andtarget can be understood in terms of a model developedby Morgenstern [4, 51. This model uses an inferencefunction - INFER - which is defined in terms of theuncertainty, H(y), about the value of some informationY, and the relative uncertainty, II ,&), about Y, givenknowledge of X. Hais equal to 0 if X fully disclosesthe exact value of Y. This means that the uncertaintyof Y, given X, is 0; thus there is no uncertainty If X discloses no information about Y, then Hfl)is equal toH(Y).The infer function is defined as: 0othenviseIn this case, E is some minimum threshold, belowwhich X supplies what can be considered an insignificant amount of information about Y.In this model, the inference fimction INFER (X y)has a value related to how much information X discloses about Y. INFER has a value of 0 if X disclosesno information about Y. It has the value of 1 if X discloses the exact value of Y. In terms of the signatureswithin the database and the sensitive targets that are tobe protected, it would be desirable if INFER (signature target) 0 for all signatures that are containedwithin the database.Each of the various types of sensitive targets expressedin the AERIE model has a signature.The entity signature(E-sig) is a signature in the database that reflects the existence and characteristics of an entity that exists withinthe real world. For example, as noted for the entity materialization example, the growing season represents thereal world target.The database itself may not contain anyexplicit information about the growing season. However,it may contain an E-sig of the growing season, whichconsists of records of the sale of items that are normallypurchased at the beginning of the growing season.Examples might be seed and fertilizer. Of course, tobe able to perform this inference, an adversary wouldhave to have a knowledge base that included the signatures of all of the sensitive targets that were of interest.The activity signature (A-sig) represents the databasemanifestation of an activity that occurs in the realworld. For example, preparation for an attack could beindicated by database entries for material requisitionsand troop movements. When it is not important todifferentiate between an activity or entity signature,we can refer to a Q-sig, which represents either an Asig or an E-sig.The relationship signature (R-sig) represents the signatures in which the various types of relationships that689

T H. Hinke et al. / Protecting Databases From Inference Attacksmay exist in the real world are reflected within thedatabase. A particular type of R-sig is the second pathinference [6,7]. An example of second path inferenceis shown in Figure 1.This represents the real-world target that the identity of companies that are supportingcertain sensitive projects must not be disclosed.This isan example of an entity-entity sensitive target. In thisexample, the relationship between a project and thecompanies that support the project is considered to beclassified at level HIGH, as indicated by the dashedline in the figure.This sensitive target can be inferredat a lower classification level (LOW) by finding a second path that makes the association between company and project1One such second path shown in the figure is [project,meeting, visitor, company]. This path recognizes thepossibility of using a meeting attendee list to associateall of the companies for which the attendees workwith the classified project.As can be noted, while sucha path exists at the HIGH level (where it can be of nouse to a LOW cleared adversary), this path is not visible at the LOW level, as indicated by its dashed line.Thus, this does not provide an exploitable second pathat the LOW level.However, as can be noted, an exploitable LOW secondpath consisting of [project, escort, visitor, company]does exist. This example assumes a working environment in which all visitors are escorted by someemployee who has been designated as an escort for theparticular visitor. This path uses the project to whichthe escort charges his or her time as the basis for associating the visitor’s company with the project, andthus forming the classified association using onlyLOW classified data2Using this second path, the value of the INFER function would not equal one, since there could be some’ Note that while HIGH and LOW are used for the example. these can begeneralized to various types of hierarchically ordered security levels (e.g.tuxlassilied, confidential, secret or top secret) or various non-hierarchicalcategories (e.g. competition-sensitive, company Delta proprietary) thathave been used to label data within the US Department of Defense.’ One solution to this problem would be to polyinstamiate [8] the projectnumber, such that the escort would have a classitied project number and anunclassified project number. He would use the unclassified project numberfor escorting visitors. If the classified project were not tied to this unclasaified number, then the inference path would be broken.false associations. For example, assume that a visitorfrom the Alpha Company was to be escorted by anemployee who works only on Project Gamma. Usingthe second path inference signature, this would beviewed as a signature for the association between theAlpha Company and Project Gamma. However, if thevisitor was actually attending a meeting for ProjectOmega, this sensitive association would not be indicated by this second path. In terms of the INFERfunction, this means that the value of INFER of thissecond-path signature for this company-project association is less than one. It is also less than one due tothe fact that in many companies, people work on multiple projects. However, the fact that the value ofINFER is less than one does not mean that it is notvaluable.The AERIE project has identified the following threetiers of data within the database that can potentiallysupport a sensitive target: schema, group and instance.In a relational database, the schema consists of the definition of the relation tables and associated attributesthat are contained in each relation. Second-path inferences (such as the company-project inference that waspreviously discussed) are an example of a type of inference signature whose potential can be detected withschema-level analysis.The group tier consists of inference signatures thatinvolve properties about groups of data values. Forexample, knowledge that certain types of parts areunique to certain types of aircraft could be used toinfer that an airbase supports a particular type of aircraft, based on the nature of parts that are shipped tothe base. This tier can also be used for inferencesinvolving signatures that involve statistical propertiesof the data or correlations with various types of data.The instance tier represents those inference signaturesthat involve individual tuples of a relation in a relational database. For example, if the chairman of the BetaCompany is known to be performing secret negotiations with a foreign government, and the chairman’saircraft (identified by its tail number) is reported to havelanded in Iceland, then one can infer that Beta is negotiating with the Government of Iceland.This would bean example of an instance-tier inference.

Computers & Security, Vol. 16, No. 8Classified SecretSupports RelationshipProject---- ----------- --UnclassifiedUnclassifiedEscortFigure 1. Company-Project3. CharacterizationTechniquesof Database DesignThe inference vulnerabilities presented in the previoussection can be countered through the use of variousinference-orientateddesign guidelinesand techniques. To provide a context to understand how thevarious techniques relate to each other and wherethey fit into the continuum of techniques that may beavailable in the future, this section presents a numberof ways to characterize the techniques.UnclassifiedRelationshipInference Using Escortmodel of increasingly more sophisticated security guidelines is illustrated by the historical development ofguidelines and methods for removing security flawsfrom operating systemsThe following ordered list, provided by Marvin Schaefer [9] traces this historical development of trusted operating system design guidelinesand associated methodologies from the earliest stages tothe latest, most technologically advanced stages:1. Testing2. PenetrationThe first means of characterizing the techniques isbased on when in the database life-cycle the techniques can be applied. Those techniques that can beapplied to the database during database design arecalled proactive techniques. These techniques do notrequire that the database data be available; only theschema is required. Those techniques that can beapplied to existing database are called reactive techniques.These techniques can use the data instances fortheir analysis.The second means of characterizing design techniquesis based on the data tier to which the technique isapplied-The data tiers were introduced in the previoussection.The t&l means of characterizing design techniques isbased on the sophistication of the approach. A usefulUnclassifiedRelationshipHolds Meetingand patch3. Code review4. Automated analysis5. Applicationof fundamental principles6. Formal analysisThe initial design guideline for the development oftrusted operating systems was limited to just the normal testing of security features that is used for testingthe correct functioning of any system. When it wasrealized that this was not sufficient, security-orientated penetration testing was applied [lo]. These penetration techniques were then supplemented withmanual reviews of the code in an attempt to discoversoftware anomalies that would lead to potential pene-691

TH. Hinke et al. / Protecting Databases From Inference Attackstration vulnerabilities[lo]. There was also someresearch in automating the search for such flawsthrough automated code analysis [l l].The attempts todiscover flaws and patch existing operating systemswere replaced with the development of security principles that could be used to design systems frominception with security as a guiding design target [12].For those systems that were to have the highest levelof assurance, all of the security relevant software wasformally specified in a language that could then beformally verified to prove that - at least at the specification level - the system satisfied a desired security policy [12]. Even today, the actual verification of thecode is viewed as beyond the current state of the artfor real (as opposed to toy) systems [ 121.The third stage concerns the application of fundamental inference-preventionprinciples when thedatabase is designed. Examples of this taken from adatabase area other than inference, are the varioustypes of integrity constraints that are applied to relational databases (e.g. referential integrity). The fourthand final stage of inference analysis and guidance isformal analysis. An example of formal analysis, takenfrom a non-inference domain, is the use of functionaldependencies in database normalization to reduce dataredundancy Formal analysis derived horn an inferencedomain could involve the formalization of the semantics of the data and an automated analysis to removepotential inference vulnerabilities as that data is placedin a database.All of these operating

A database holds a great amount of data that is critical for the operation of enterprises, be they commercial or government.This data, while providing crucial sup- port for the mission of the enterprise, can also provide a source of sensitive information that is useful for .

Related Documents:

OWASP The OWASP Foundation

injection) Code injection attacks: also known as "code poisoning attacks" examples: Cookie poisoning attacks HTML injection attacks File injection attacks Server pages injection attacks (e.g. ASP, PHP) Script injection (e.g. cross-site scripting) attacks Shell injection attacks SQL injection attacks XML poisoning attacks

24 Views

1y ago

Stochastic Variational Inference for Dynamic Correlated ...

Stochastic Variational Inference. We develop a scal-able inference method for our model based on stochas-tic variational inference (SVI) (Hoffman et al., 2013), which combines variational inference with stochastic gra-dient estimation. Two key ingredients of our infer

28 Views

2y ago

Inference in Deep Gaussian Processes using Stochastic ...

2.3 Inference The goal of inference is to marginalize the inducing outputs fu lgL l 1 and layer outputs ff lg L l 1 and approximate the marginal likelihood p(y). This section discusses prior works regarding inference. Doubly Stochastic Variation Inference DSVI is

35 Views

2y ago

Instructions for using this template - APNIC

APNIC 46 Network security workshop, deployed 7 honeypots to a cloud service 21,077 attacks in 24 hours Top 5 sensors –training06 (8,431 attacks) –training01 (5,268 attacks) –training04 (2,208 attacks) –training07 (2,025 attacks) –training03 (1,850 attacks)

44 Views

2y ago

Detection of DDoS attacks using RNN-LSTM and Hybrid model ensemble.

Detection of DDoS attacks using RNN-LSTM and Hybrid model ensemble. Siva Sarat Kona 18170366 Abstract The primary concern in the industry is cyber attacks. Among all, DDoS attacks are at the top of the list. The rapid increase in cloud migration also increases the scope of attacks. These DDoS attacks are of di erent types like denial of service,

15 Views

1y ago

Cyber Security Expert

3 Cloud Computing Attacks a. Side channel attacks b. Service Hijacking c. DNS attacks d. Sql injection attacks e. Wrapping attacks f. Network sniffing g. Session ridding h. DOS / DDOS attacks 4 Securing Cloud computing a. Cloud security control layers b. Responsibilites in Cloud Security c. OWASP top 10 Cloud Security 5 Cloud Security Tools a.

11 Views

1y ago

7 databases 53 databases 58 databases 14 databases ...

14 databases History 183 databases ProQuest Primary Sources available for: Introduction ProQuest Historical Primary Sources Support Research, Teaching and Learning. Faculty and students are using a variety of resources in research, teaching and learning – including primary sources,

35 Views

3y ago

/LLO edag peq - BBMKU

Control Techniques, Database Recovery Techniques, Object and Object-Relational Databases; Database Security and Authorization. Enhanced Data Models: Temporal Database Concepts, Multimedia Databases, Deductive Databases, XML and Internet Databases; Mobile Databases, Geographic Information Systems, Genome Data Management, Distributed Databases .

9 Views

9m ago

Recent Views

Grammar as a Foreign Language - List of Proceedings

Grammar as a Foreign Language Oriol Vinyals Google vinyals@google.com Lukasz Kaiser Google lukaszkaiser@google.com Terry Koo Google terrykoo@google.com Slav Petrov Google slav@google.com Ilya Sutskever Google ilyasu@google.com Geoffrey Hinton Google geoffhinton@google.com Abstract Synta

2y ago

445 Views

Attention is All you Need - NIPS

Google Brain avaswani@google.com Noam Shazeer Google Brain noam@google.com Niki Parmar Google Research nikip@google.com Jakob Uszkoreit Google Research usz@google.com Llion Jones Google Research llion@google.com Aidan N. Gomezy University of Toronto aidan@cs.toronto.edu Łukasz Kaiser Google Brain lukaszkaiser@google.com Illia Polosukhinz illia .

1y ago

303 Views

GSA Implementation of Google (G) Suite

Google Meet Classic Hangouts Google Chat Google Calendar Google Drive and Shared Drive Google Docs Google Sheets Google Slides Google Forms Google Sites Google Keep Apps Script D

2y ago

316 Views

Google Drive (Google Docs, Google Sheets, Google Slides)

Google Drive (Google Docs, Google Sheets, Google Slides) Employees are automatically issued a Kyrene Google account. Navigate to drive.google.com. Use Kyrene email address and network password to login. Launch in Chrome browser for best experience. Google Drive is a cloud storage sys

2y ago

388 Views

Quick Guide of Using Google Home to Control Smart Devices

Configuration needs Google Home app. Search "Google Home" in App Store or Google Play to install the app. 3.1 Set up Google Home with Google Home app You can skip this part if your Google Home is already set up. 1. Make sure your Google Home is energized. 2. Open the Google Home app by tapping the app icon on your mobile device. 3.

1y ago

326 Views

Elaboração de Provas Online usando o Formulário Google Docs

2 Após o login acesse o Google Drive ou o Google Docs e selecione a ferramenta Google Forms (Formulários). Clique na caixa de Ferramentas do Google, localizada no canto direito superior da tela e selecione o Google Drive. Na tela do Google Drive clique em New , opção More e selecione Google Forms. OBS: É possível acessar o google

10m ago

123 Views

ACS WASC Templates

File upload, Folder upload, Google Docs, Google Sheets, or Google Slides. You can also create Google Forms, Google Drawings, Google My Maps, etc. Share with exactly who you want — without email attachments. Search or sort your list of files, folders, and Google Docs. Preview files and Google Docs.

2y ago

366 Views

Google Drive - San Bernardino City Unified School District

Google Apps All of the Google applications that are available upon logging into Google.com (G , Gmail, Gphotos, Gdrive, etc.). Google Suite Google’s online cloud based office companion applications (Docs, Sheets, Slides). Google Drive Google’s online cloud storage and file sharing/collaboration application.

2y ago

378 Views

Single Sign On for Google Apps with NetScaler Unified Gateway

Google Apps for Work is a suite of cloud computing productivity and collaboration applications provided by Google on a subscription basis. It includes Google’s popular web applications including Gmail, Google Drive, Google Hangouts, Google Calendar and Google

2y ago

295 Views

Serviceteil

Google 84, 87, 124 Google 110 Google AdWords 101, 103 Google Alerts 127 Google Analytics 89 Google Maps 100, 110, 173 Google-Maps 63 Google Places 100, 103, 124 Graphiken 66 H Haftung 170 Haftungsausschluss 72 Hausfarbe 11 Headline 35 Heilmittelwerbegesetz 14, 69, 163 Heilversprechen 164 HONcode 78 HTML 58 HWG 31 I Imagefilm 31

2y ago

336 Views

Best practices for managing identities when you move to Google Cloud

Google Cloud. To provide t he informat ion an organizat ion would ne e d to transfer data and ownership from one Google Account to anot her for s ome of t he noncore Google s er vice s, such as Google Ads, Google Analyt ics, or DV360. Intende d audience Organizat ion administrators. Sta planning Google Cloud / Google Wor kspace migrat ion. Key .

1y ago

481 Views

MANAGERIAL FINANCE - GBV

of Managerial Finance page 2 Introduction to Managerial Finance 1 Starbucks—A Taste for Growth page 3 1.1 Finance and Business What Is Finance? 4 Major Areas and Opportunities in Finance 4 Legal Forms of Business Organization 5 Why Study Managerial Finance? Review Questions 9 1.2 The Managerial Finance Function 9 Organization of the Finance

3y ago

6.8K Views

Chapter 1 The roles of finance function in organisations

The roles of the finance function in organisations 4. The role of ethics in the role of the finance function Ethics is the system of moral principles that examines the concept of right and wrong. Ethics underpins an organisation’s sustained value creation. The roles that the finance function performs should be carried out in an .File Size: 888KBPage Count: 10Explore furtherRole of the Finance Function in the Financial Management .www.managementstudyguide.c Roles and Responsibilities of a Finance Department in a .www.pharmapproach.comRoles and Responsibilities of a Finance Department .www.smythecpa.comTop 10 – Functions of Business Finance in an om23 Functions and Duties of Accounting and Finance nded to you b

1y ago

335 Views

Introduction - Google Earth User Guide

Google Earth Community: Learn from other Google Earth users by asking questions and sharing answers on the Google Earth Community forums. Using Google Earth: This blog describes how you can use some of the interesting features of Google Earth. Selecting a Server Note: This section is relevant to Google Earth Pro and EC users.

3y ago

288 Views

Using Google Forms to Manage Officials Signups

Google Sheets, deleting a response from the form or sheet will not affect the other. Once the Google Form is linked to a Google Sheet, clicking on the spreadsheet icon will open the linked Google Sheet. Google Responses Sheet Google automatically creates and populates the sp

2y ago

276 Views

Protecting Databases From Inference Attacks*

It looks like you're using an ad-blocker