Sample Selector

Sample Selector is a tool for creating and editing samples, or groups of data you compare across—they're not "samples" in the statistical sense, but more like filters.

By default, a single sample exists: "All Data". With the Sample Selector, you can create new samples to organize your data.

You can use samples to:

A sample is composed of one or more filters, specific conditions that narrow down your sample.

Creating a sample

The general process for creating a sample is to:

The effect of multiple filters

DataShop interprets each filter after the first as an additional restriction on the data that is included in the sample. This is also known as a logical "AND". You can see the results of multiple filters in the sample preview as soon as all filters are "saved".

Read more about the Sample Selector

« Back to DataShop

About DataShop

The Pittsburgh Science of Learning Center (PSLC) DataShop is a data repository and web application for learning science researchers. It provides secure data storage as well as an array of analysis and visualization tools available through a web-based interface. Primarily, DataShop stores learner interactions from online course materials that include intelligent tutors. Data is collected from the seven PSLC courses: Algebra, Chemistry, Chinese, English, French, Geometry and Physics. There are also sources external to the PSLC that contribute to DataShop, such as middle school math data from the Assistments project (http://www.assistment.org).

DataShop can store any type of data associated with a course or study. This includes intelligent tutor data (which is capable of being analyzed through the analysis and visualization tools) as well as any related publications, files, presentations, or electronic artifacts a researcher would like to store. Courses and studies are represented as datasets, which are organized by project. An example of this is the “Algebra 1 2005-2006” dataset, which is grouped with similar datasets under the “Algebra Course” project. Semantic information can be stored for each dataset, providing additional context.

The amount of data in DataShop is constantly growing. The majority of datasets contain tutor interaction data, while others are files-only datasets, used solely for central and secure file storage. As of March 2008 there are about 8,700,000 tutor transactions, representing about 45,000 student hours available for analysis. Researchers have utilized DataShop to explore learning issues in a variety of educational domains. These include, but are not limited to, collaborative problem solving in Algebra (Rummel, Spada, Diziol, 2007), self-explanation in Physics (Hausmann & VanLehn , 2007), the effectiveness of worked examples and polite language in a Stoichiometry tutor (McLaren, Lim, Yaron, & Koedinger, 2007) and the optimization of knowledge component learning in Chinese (Pavlik, Presson, & Koedinger , 2007).

DataShop is designed to enable learning researchers to get off to a solid start analyzing their data.

To learn more about DataShop, explore the help pages here, start using the web application, or see our section on the PSLC website.

Version 3.0.22 July 1, 2008 LearnLab logo