methods

Estimating proportions for finite populations

Introduction In a previous post I talked about estimating a binomial proportion, including for rare events. The reason I wrote that was for background to this post. Here, we’ll again be looking at estimating proportions - and can include rare events - but with an added wrinkle: A population that is finite. In the Binomial case, every observation is assumed to be independent and have a fixed probability of the outcome of interest (a “success” or a “failure”).

Estimating binomial proportions for rare events

Introduction The common approach Common Approach: The math Why the common approach is bad Alternatives Alternative 1: Wilson and Agresi-Coull intervals Wilson interval: The math Alternative 2: Bayesian method Comparisons Summary Introduction Estimating a proportion gets covered in virtually every introductory statistics course, so why would I be writing a post about it? There are three reasons: One of my goals with these posts is to explain some basic statistical concepts.

One or two tails?

Introduction Should you use a two-tailed test or a one-tailed test (or similarly, a confidence interval or 1-sided confidence bound)? For those just learning statistics, or who have had only a little training in the subject, this question comes up fairly often. And there is some conflicting information and advice out there. Most often I’ve seen comments critical of one-sided methods, such as: The short answer is: Never use one tailed tests.

Robust estimation of reduced rank models to large spatial datasets

For large datasets, spatial covariances are often modeled using basis functions and covariance of a reduced dimensional latent spatial process. For skewed data, likelihood based approaches with Gaussian assumption may not lead to faithful inference. …

Threshold knot selection for large-scale spatial models with applications to the Deepwater Horizon disaster

Large spatial datasets are typically modelled through a small set of knot locations; often these locations are specified by the investigator by arbitrary criteria. Existing methods of estimating the locations of knots assume their number is known a …

Determining the number of test fires needed to represent the variability present within firearms of various calibers

The Association of Firearm and Toolmark Examiners recommends a minimum of two test fires be performed when an unknown firearm is submitted to a laboratory prior to doing a comparison with a cartridge case collected from a crime scene. Limited …

Testing for Inequality Constraints in Singular Models by Trimming or Winsorizing the Variance Matrix

There are many applications in which a statistic follows, at least asymptotically, a normal distribution with a singular or nearly singular variance matrix. A classic example occurs in linear regression models under multicollinearity but there are …

Determining the number of test fires needed to represent the variability present within 9mm Luger firearms

Many studies have been performed in recent years in the field of firearm examination with the goal of providing an objective method for comparisons of fired cartridge cases. No published research to support the number of test fires needed to …

CLME An R package for linear mixed effects models under inequality constraints

In many applications researchers are typically interested in testing for inequality constraints in the context of linear fixed effects and mixed effects models. Although there exists a large body of literature for performing statistical inference …

A flexible class of reduced rank spatial models for large non-Gaussian datasets

In environmental studies, the datasets exhibiting non-Gaussian properties, such as heavier or lighter tails and multimodality, are very common. The research on dealing with such datasets in reduced rank perspectives is very limited. In this chapter, …