These are my links for May 27th through July 28th:
- Statistical Modeling: The Two Cultures – There are two cultures in the use of statistical modeling to reach conclusions from data. One assumes that the data are generated by a given stochastic data model. The other uses algorithmic models and treats the data mechanism as unknown. The statistical community has been committed to the almost exclusive use of data models. This commitment<br />
has led to irrelevant theory, questionable conclusions, and has kept statisticians from working on a large range of interesting current problems. Algorithmic modeling, both in theory and practice, has developed rapidly in fields outside statistics. It can be used both on large complex data sets and as a more accurate and informative alternative to data modeling on smaller data sets. If our goal as a field is to use data to solve problems, then we need to move away from exclusive dependence on data models and adopt a more diverse set of tools.
- Brief history of data visualization – Data visualization is a pretty literal term that means, quite simply, the visual representation of quantitative data. In this course we’ll learn common techniques for visualizing data, as well as some strategies for managing information digitally. But first, a brief history.
- S. Thompson. Motif-index of folk-literature – a classification of narrative elements in folktales, ballads, myths, fables, mediaeval romances, exempla, fabliaux, jest-books, and local legends.
- What is data science? – O’Reilly Radar – We’ve all heard it: according to Hal Varian, statistics is the next sexy job. Five years ago, in What is Web 2.0, Tim O’Reilly said that “data is the next Intel Inside.” But what does that statement mean? Why do we suddenly care about statistics and about data?<br />
<br />
In this post, I examine the many sides of data science — the technologies, the companies and the unique skill sets.
- [1005.0437] A Unifying View of Multiple Kernel Learning – Recent research on multiple kernel learning has lead to a number of approaches for combining kernels in regularized risk minimization. The proposed approaches include different formulations of objectives and varying regularization strategies. In this paper we present a unifying general optimization criterion for multiple kernel learning and show how existing formulations are subsumed as special cases. We also derive the criterion’s dual representation, which is suitable for general smooth optimization algorithms. Finally, we evaluate multiple kernel learning in this framework analytically using a Rademacher complexity bound on the generalization error and empirically in a set of experiments.
These are my links for April 12th through April 22nd:
- Gilbert Harman, Sanjeev Kulkarni – Reliable Reasoning: Induction and Statistical Learning Theory – Reviewed by Kevin Kelly, and Conor Mayo-Wilson, Carnegie Mellon University – Philosophical Reviews – University of Notre Dame – Harman and Kulkarni’s Reliable Reasoning is a welcome attempt to relate machine learning to the philosophy of induction at an introductory level suitable for undergraduates or for professional philosophers and scientists who desire a painless introduction to the subject. In clear, helpful figures and engagingly informal prose, the slender volume summarizes the main results and concepts of statistical learning theory, a particular statistical framework developed by V. N. Vapnik, A. J. Chervonenkis and others for machine learning and other applications (Vapnik and Chervonenkis 1974, Vapnik 1998, 1999, 2000). Harman and Kulkarni also explain how statistical learning theory might shed light on philosophical topics like the problem of induction and the role of simplicity in theory choice. The book is mainly an informal précis of Vapnik (2000), with some important differences.
- Independence and Large Cardinals (Stanford Encyclopedia of Philosophy) – The independence results in arithmetic and set theory led to a proliferation of mathematical systems. One very general way to investigate the space of possible mathematical systems is under the relation of interpretability. Under this relation the space of possible mathematical systems forms an intricate hierarchy of increasingly strong systems. Large cardinal axioms provide a canonical means of climbing this hierarchy and they play a central role in comparing systems from conceptually distinct domains.
- The Sources of Normativity (application/pdf Object) – Whether this is true or not, the moral philosophy of the modern period can be read as a search for the source of normativity. Philosophers in the modern period have come up with four successive answers to the question of what makes morality normative. In brief, they are these: (1) Voluntarism, (2) Realism, (3) Reflective Endorsement, ( 4 ) The Appeal to Autonomy.
- Earliest Known Uses of Some of the Words of Mathematics – These pages attempt to show the first uses of various words used in mathematics. Research for these pages is ongoing, and a citation should not be assumed to be the earliest use unless it is indicated as such.
These are my links for March 12th through March 23rd:
These are my links for September 14th through September 22nd:
- Philosophy Now | Daniel Dennett: Autobiography (Part 1) – What makes a philosopher? In the first of a two-part mini-epic, Daniel C. Dennett contemplates a life of the mind – his own. Part 1: The pre-professional years.
- Philosopher’s Annual – Our goal is to select the ten best articles published in philosophy each year—an attempt as simple to state as it is admittedly impossible to fulfill. Against a background of twenty-four volumes in hard copy, the Annual is now available entirely online.
- Revolutions: Interactive stock visualizations with R – Jeroen Ooms, who recently completed his Masters in Statistics at Utrech University, has created an outstanding web-based drag-and-drop application for visualizing financial data. With his “StockPlot” t application, you can select any stock from a number of world exchanges (including NASDAQ, DAX, FTSE), and drag it to a worksheet to see a time-series of the stock price. You can arrange up to four charts on the same worksheet for comparison purposes, and control the timeframe and appearance of each chart.
- Revolutions: Machine Learning in R, in a nutshell – Josh Reich has created a concise R script demonstrating various machine-learning techniques in R with simple, self-contained examples.
- Information Processing and Thermodynamic Entropy (Stanford Encyclopedia of Philosophy) – Are principles of information processing necessary to demonstrate the consistency of statistical mechanics? Does the physical implementation of a computational operation have a fundamental thermodynamic cost, purely by virtue of its logical properties? These two questions lie at the centre of a large body of literature concerned with the Szilard engine (a variant of the Maxwell’s demon thought experiment), Landauer’s principle (supposed to embody the fundamental principle of the thermodynamics of computation) and possible connections between the two. A variety of attempts to answer these questions have illustrated many open questions in the foundations of statistical mechanics.
- Christopher J. G. Meacham, Two Mistakes Regarding The Principal Principle | PhilPapers – This paper examines two mistakes regarding David Lewis’ Principal Principle that have appeared in the recent literature. These particular mistakes are worth looking at for several reasons: the thoughts that lead to these mistakes are natural ones, the principles that result from these mistakes are untenable, and these mistakes have led to significant misconceptions regarding the role of admissibility and time. After correcting these mistakes, the paper discusses the correct roles of time and admissibility. With these results in hand, the paper concludes by showing that one way of formulating the chance-credence relation has a distinct advantage over its rivals.
- José Luis Bermúdez – Decision Theory and Rationality – Reviewed by Lara Buchak, UC Berkeley – Philosophical Reviews – University of Notre Dame – Decision theory is used for a variety of purposes: decision makers use it to guide their own actions, and theorists use it both normatively to assess decision makers and to predict and explain their decisions. This book investigates whether the theory can fulfill all three of these purposes. In particular, Bermúdez explores three questions that decision theory must answer under any guise: How should we understand utility and preference? How finely should we individuate the possible outcomes in a decision problem? And how should choice be constrained over time? He argues that there are no answers to these questions that allow decision theory to serve all three purposes.
These are my links for July 6th through July 8th:
- How to choose a statistical test – This book has discussed many different statistical tests. To select the right test, ask yourself two questions: What kind of data have you collected? What is your goal? Then refer to Table 37.1.
- NPWRC :: Statistical Significance Testing – Four basic steps constitute statistical hypothesis testing. First, one develops a null hypothesis about some phenomenon or parameter. This null hypothesis is generally the opposite of the research hypothesis, which is what the investigator truly believes and wants to demonstrate. Research hypotheses may be generated either inductively, from a study of observations already made, or deductively, deriving from theory. Next, data are collected that bear on the issue, typically by an experiment or by sampling. (Null hypotheses often are developed after the data are in hand and have been rummaged through, but that’s another topic.)
- Data Mining Techniques – Data Mining is an analytic process designed to explore data (usually large amounts of data – typically business or market related) in search of consistent patterns and/or systematic relationships between variables, and then to validate the findings by applying the detected patterns to new subsets of data. The ultimate goal of data mining is prediction – and predictive data mining is the most common type of data mining and one that has the most direct business applications.
- An Overview of Data Mining Techniques – This overview provides a description of some of the most common data mining algorithms in use today. We have broken the discussion into two sections, each with a specific theme:* Classical Techniques: Statistics, Neighborhoods and Clustering
* Next Generation Techniques: Trees, Networks and Rules
Each section will describe a number of data mining algorithms at a high level, focusing on the “big picture” so that the reader will be able to understand how each algorithm fits into the landscape of data mining techniques. Overall, six broad classes of data mining algorithms are covered. Although there are a number of other algorithms and many variations of the techniques described, one of the algorithms from this group of six is almost always used in real world deployments of data mining systems.
- MachineLearning.pdf (application/pdf Object) – Over the past 50 years the study of Machine Learning has grown from the efforts of a handful of computer engineers exploring whether computers could learn to play games, and a field of Statistics that largely ignored computational considerations, to a broad discipline that has produced fundamental statistical-computational theories of learning processes, has designed learning algorithms that are routinely used in commercial systems
for speech recognition, computer vision, and a variety of other tasks, and has spun off an industry in data mining to discover hidden regularities in the growing volumes of online data. This document provides a brief and personal view of the discipline that has emerged as Machine Learning, the fundamental questions it addresses, its relationship to other sciences and society, and where it might be headed
|
|