Statistical methods for reliably updatingmeta-analysesMark SimmondsCentre for Reviews and DisseminationUniversity of York, UKWith:Julian Elliott, Joanne McKenzie, Georgia Salanti,Adriani Nikolakopoulou, Julian Higgins
Updating meta-analyses When should we update a meta-analysis? When new studies emerge? When new data might alter ourconclusions? Updating is time-consumingCentre for Reviews and Dissemination
Some issues When can we stop updating? Which meta-analyses should have priority for updating? Conclusions can change over time– Risk of error if we stop too soonCentre for Reviews and Dissemination
Type I error Assuming an intervention is effectivewhen it isn’t Usually set at 5% Increases the more updates we perform Can we accept a conventionally“statistically significant” meta-analysis?Centre for Reviews and Dissemination
Cumulative meta-analysisIt works!OK, maybe notIt’s a failure!OK, maybe notCentre for Reviews and Dissemination
Type I error in meta-analysesCentre for Reviews and Dissemination
Type II error Assuming an intervention isn’t effectivewhen it is Not controlled in a meta-analysis When can we stop updatingnon-significant meta-analyses?Centre for Reviews and Dissemination
Cumulative meta-analysisDoesn’t look promisingGive up now?Definitely stop nowOh wait Centre for Reviews and Dissemination
A caveat The summary effect estimates (andconfidence intervals?) are valid at eachupdate Decisions made on the basis of theresults may not be– Particularly decisions about whether toupdateCentre for Reviews and Dissemination
Parallels with sequential trial design Aim to stop a trial as soon as possiblereview Select a desired Type I and II error rate And desired clinical effect Perform interim analyses throughout trialMetareviewCentre for Reviews and Dissemination
Key differences Meta-analysis is not controlled– No control over timing of studies– Size of studies Heterogeneity– Studies have different protocols– Estimated effects may not be consistentCentre for Reviews and Dissemination
Controlling error Control Type I and Type II error– Sequential meta-analysis– Trial sequential analysis Control Type I error– Law of Iterated Logarithm– “Shuster-Pocock” method Other methods––––Fully Bayesian analysisRobustness or stability of analysisConsequences of adding new studiesPower gains from adding new studiesCentre for Reviews and Dissemination
Example from CochraneI2 95%Centre for Reviews and Dissemination
Cumulative meta-analysisCentre for Reviews and Dissemination
Sequential meta-analysis (SMA)Higgins, Simmonds, Whitehead 2010 Calculate cumulative Z score andcumulative Information for each updatedmeta-analysis Stop when a pre-specified boundary iscrossed Boundary designed to control type I and IIerrorCentre for Reviews and Dissemination
Accounting for heterogeneity Select a prior estimate of heterogeneity– Generally assuming high heterogeneity Use Bayesian methods to calculate posteriorheterogeneity estimate at each update Use this Bayesian estimate in the updatedmeta-analysisCentre for Reviews and Dissemination
Sequential meta-analysisCentre for Reviews and Dissemination
Trial sequential analysis (TSA)Wetterslev, Thorlund, Brok, Gluud 2008 Select a required sample size for the metaanalysis Calculate alpha-spending boundaries Stop if Z score exceeds the boundary Or if sample size is reached Sample size must be adjusted forheterogeneityCentre for Reviews and Dissemination
ExampleCentre for Reviews and Dissemination
Law of Iterated Logarithm (LIL)Lan, Hu, Cappelleri 2007 Uses an adjusted Z statistic 𝑍 𝑍 𝜆 log log 𝑁 This is bounded as 𝑁 So controls Type I error Commonly sets 𝜆 2Centre for Reviews and Dissemination
ExampleCentre for Reviews and Dissemination
Shuster-Pocock methodShuster, Neu 2013 Compares the Z statistic to a t distribution Parameters of t distribution are based onPocock’s group sequential boundaries Must specify number of meta-analysesperformedCentre for Reviews and Dissemination
ExampleCentre for Reviews and Dissemination
76 Cochrane Reviews 76 Reviews: 286 meta-analysesBinary outcome194 (68%)Continuousoutcome92Stat. sig.178 (62%)Not stat. sig.108Trials per MAMedian 9IQR: 6 to 14Max: 200Effect size *Median 0.47If stat sig. 0.69If not 0.25I2 0: 32%I2 90%: 7.0%If stat sig. 46%If not: 13%I2* Log odds ratio or standardised mean differenceCentre for Reviews and Dissemination
Applying meta-analysis updating methods Apply to all 286 meta-analyses: “Naïve” cumulative meta-analysis Trial sequential analysis– (heterogeneity adjusted) Sequential meta-analysis– With no prior, 50% I2 and 90% I2 priors Law of iterated logarithm Shuster-PocockCentre for Reviews and Dissemination
Conclusions of analysesCentre for Reviews and Dissemination
Extra trials required to reach a conclusionCentre for Reviews and Dissemination
Realistic review updating Have assumed a new meta-analysis aftereach new trial In reality updates are less frequent First analysis will have good proportion oftotal trials Re-analyse assuming updates once 50%,70%, 90% and 100% of trials are availableCentre for Reviews and Dissemination
Conclusions using realistic updatingCentre for Reviews and Dissemination
Simulation study Simulated meta-analyses varying:– True treatment effect:– Number of studies:– Heterogeneity:0 or 0.15 to 50I2 0 to 90% Fixed total sample size of 9000– 90% power to detect effect of 0.1 if I2 50%Centre for Reviews and Dissemination
Methods applied Naïve analysis (standard cumulative MA) Trial Sequential Analysis (TSA) Sequential Meta-Analysis (SMA)– No prior heterogeneity– Prior I2 of 50% or 90% Law of Iterated Logarithm (LIL) Shuster methodCentre for Reviews and Dissemination
False positive rates – Type I error 20 trials / updates, I2 25%Centre for Reviews and Dissemination
False positive rates – Type I errorCentre for Reviews and Dissemination
Cumulative power 20 trials / updates, I2 25%Centre for Reviews and Dissemination
Cumulative powerCentre for Reviews and Dissemination
Conventional “Naïve” analysis Too many inappropriate positive conclusions– Elevated Type I error rate– But not vastly elevated for most updatedreviews? Biased estimates of effect Half of all analyses showing significantresults are based on too little evidence?Centre for Reviews and Dissemination
Trial Sequential Analysis Controls for Type I and II error Need to set desired effect Complex to run Required sample size varies with time– Can lead to inconsistent updatesCentre for Reviews and Dissemination
Sequential Meta-Analysis Controls for Type I and II error Need to set desired effectComplex to runStatistical information not intuitiveLimited choice of boundaries Bayesian heterogeneity too conservative? Not needed in practice?Centre for Reviews and Dissemination
Law of Iterated Logarithm Controls for Type I error Easy to implement Biased estimates of effect at stopping? Over-conservative: low-power Uncertainty over parameterCentre for Reviews and Dissemination
Shuster-Pocock Controls for Type I error Fairly easy to implement Needs more trials before stopping Need to pre-specify number of updates? Needs many studies to have adequatepowerCentre for Reviews and Dissemination
Do we need these methods? Is the problem with “naïve” analysis seriousenough in real Cochrane reviews? Do the methods needlessly delay a statisticallysignificant result? Too much focus on decision making overestimation? More complex than necessary?Centre for Reviews and Dissemination
When should they be implemented? At protocol stage in all reviews? At first update? Only once a statistically significant result isfound? Only when evidence is limited?– E.g. small total sample sizeCentre for Reviews and Dissemination
What are Cochrane reviews for? To present the best evidence at thecurrent time? To aid in making medical decisions orguiding future trials?Centre for Reviews and Dissemination
76 Cochrane Reviews 76 Reviews: 286 meta-analyses Centre for Reviews and Dissemination Binary outcome 194 (68%) Continuous outcome 92 Stat. sig. 178 (62%) Not stat. sig. 108 Trials per MA Median 9 IQR: 6 to 14 Max: 200 Effect size * Median 0.47 If stat sig. 0.69 If not 0.25 I2 I 2 0: 32% I 90%: 7.0% If stat sig. 46% If not: 13%
Bruksanvisning för bilstereo . Bruksanvisning for bilstereo . Instrukcja obsługi samochodowego odtwarzacza stereo . Operating Instructions for Car Stereo . 610-104 . SV . Bruksanvisning i original
10 tips och tricks för att lyckas med ert sap-projekt 20 SAPSANYTT 2/2015 De flesta projektledare känner säkert till Cobb’s paradox. Martin Cobb verkade som CIO för sekretariatet för Treasury Board of Canada 1995 då han ställde frågan
service i Norge och Finland drivs inom ramen för ett enskilt företag (NRK. 1 och Yleisradio), fin ns det i Sverige tre: Ett för tv (Sveriges Television , SVT ), ett för radio (Sveriges Radio , SR ) och ett för utbildnings program (Sveriges Utbildningsradio, UR, vilket till följd av sin begränsade storlek inte återfinns bland de 25 största
Hotell För hotell anges de tre klasserna A/B, C och D. Det betyder att den "normala" standarden C är acceptabel men att motiven för en högre standard är starka. Ljudklass C motsvarar de tidigare normkraven för hotell, ljudklass A/B motsvarar kraven för moderna hotell med hög standard och ljudklass D kan användas vid
LÄS NOGGRANT FÖLJANDE VILLKOR FÖR APPLE DEVELOPER PROGRAM LICENCE . Apple Developer Program License Agreement Syfte Du vill använda Apple-mjukvara (enligt definitionen nedan) för att utveckla en eller flera Applikationer (enligt definitionen nedan) för Apple-märkta produkter. . Applikationer som utvecklas för iOS-produkter, Apple .
Statistical Methods in Particle Physics WS 2017/18 K. Reygers 1. Basic Concepts Useful Reading Material G. Cowan, Statistical Data Analysis L. Lista, Statistical Methods for Data Analysis in Particle Physics Behnke, Kroeninger, Schott, Schoerner-Sadenius: Data Analysis in High Energy Physics: A Practical Guide to Statistical Methods
och krav. Maskinerna skriver ut upp till fyra tum breda etiketter med direkt termoteknik och termotransferteknik och är lämpliga för en lång rad användningsområden på vertikala marknader. TD-seriens professionella etikettskrivare för . skrivbordet. Brothers nya avancerade 4-tums etikettskrivare för skrivbordet är effektiva och enkla att
Den kanadensiska språkvetaren Jim Cummins har visat i sin forskning från år 1979 att det kan ta 1 till 3 år för att lära sig ett vardagsspråk och mellan 5 till 7 år för att behärska ett akademiskt språk.4 Han införde två begrepp för att beskriva elevernas språkliga kompetens: BI