Extreme Optimization™: Complexity made simple.

Math and Statistics
Libraries for .NET

  • Home
  • Features
    • Math Library
    • Vector and Matrix Library
    • Statistics Library
    • Performance
    • Usability
  • Documentation
    • Introduction
    • Math Library User's Guide
    • Vector and Matrix Library User's Guide
    • Data Analysis Library User's Guide
    • Statistics Library User's Guide
    • Reference
  • Resources
    • Downloads
    • QuickStart Samples
    • Sample Applications
    • Frequently Asked Questions
    • Technical Support
  • Order
  • Company
    • About us
    • Testimonials
    • Customers
    • Press Releases
    • Careers
    • Partners
    • Contact us
Introduction
Deployment Guide
Nuget packages
Configuration
Using Parallelism
Expand Mathematics Library User's GuideMathematics Library User's Guide
Expand Vector and Matrix Library User's GuideVector and Matrix Library User's Guide
Expand Data Analysis Library User's GuideData Analysis Library User's Guide
Expand Statistics Library User's GuideStatistics Library User's Guide
Expand Data Access Library User's GuideData Access Library User's Guide
Expand ReferenceReference

Skip Navigation LinksHome»Documentation»Data Analysis Library User's Guide

Data Analysis Library User's Guide

Extreme Optimization Numerical Libraries for .NET Professional

A data frame is a table of values suitable for data analysis. Formally, a data frame is a table-like structure made up of three components: a collection of vectors that contain the data for each column, a set of keys (called an index) to label the columns, and an index to label the rows.

Each column may have a different data type. Column values are immutable, but the collection of columns is not. This means that you can add and remove columns to and from a data frame. You can even replace a column by a new one with the same key. But the column vectors themselves can not change. This means, for example, that any operation that adds or removes rows returns a new data frame.

A data frame is very similar to a matrix. Both are two-dimensional tables that can have row and column indexes. Whereas vectors and matrices are ideally suited for computational tasks, the emphasis with data frames is on data manipulation and transformation. This different focus is reflected in the fact that vectors and matrices are generic over the type of the elements, while data frames are generic over the type of the row and column keys.

Overview

In this section:

  • Data Frames Indexes. How to create data frames. Operations on data frames. Importing and exporting.

  • Data wrangling The art of transforming data frames into a form suitable for processing by statistical and machine learning algorithms.

  • Grouping and Aggregation One of the core operations on data frames is the grouping and aggregation of data.

  • Working with Categorical Data Variables that can take on only a limited number of values deserve special treatment.

  • Working with Time Series Data Many data sets have observations that are indexed by a date or time value. This section discusses several enhancements that make working with such data sets more convenient.

Copyright (c) 2004-2023 ExoAnalytics Inc.

Send comments on this topic to support@extremeoptimization.com

Copyright © 2004-2023, Extreme Optimization. All rights reserved.
Extreme Optimization, Complexity made simple, M#, and M Sharp are trademarks of ExoAnalytics Inc.
Microsoft, Visual C#, Visual Basic, Visual Studio, Visual Studio.NET, and the Optimized for Visual Studio logo
are registered trademarks of Microsoft Corporation.