DataFrame/sort - Maple Programming Help

Online Help

All Products    Maple    MapleSim


Home : Support : Online Help : Statistics and Data Analysis : DataFrames and DataSeries : DataSeries Commands : DataFrame/sort

DataFrame/sort

sort a DataFrame by values in a given column

 

Calling Sequence

Parameters

Options

Description

Examples

Compatibility

Calling Sequence

sort( DF, index )

sort( DF, index, ordering, options )

Parameters

DF

-

a DataFrame object

index

-

name, string or integer value; specifies the column index for the chosen column

ordering

-

symbol or Boolean function of two arguments; sort ordering

options

-

(optional) equation(s) of the form option=value

Options

• 

nonstrict: specifies to use a nonstrict comparison function type

• 

strict: specifies to use a strict comparison function type

• 

key: specifies to use a key comparison function type

• 

output: sorted or permutation; specify whether to return the sorted DataFrame or the permutation of rows that sorts the DataFrame accordingly

Description

• 

The sort command sorts the rows of a DataFrame according to the sorted values in a given column.

• 

If the ordering option is given, it specifies the ordering for sorting elements. By default, elements are sorted in ascending order for numeric values and lexicographic order for strings and names.

– 

`<`:  If ordering is the symbol `<` or numeric, then the DataFrame is sorted in ascending numerical order of values in a given column. The index column of the DataFrame must contain elements of type({numeric, real_infinity}),

– 

`>`:  If ordering is the symbol `>`, then the DataFrame is sorted into descending numerical order.

– 

address:  If ordering is the symbol address, then the elements are sorted by address (a non-deterministic run-time specific property of the underlying data structure).

– 

length:  If ordering is the symbol length, then the elements are sorted by length where length is as determined by the length function.

– 

lexorder:  If ordering is the symbol lexorder or string, then lists of strings or symbols are sorted into lexicographic order.

– 

Otherwise, ordering must be a Boolean-valued function, F, of two arguments.  Specifically, Fa&comma;b returns false if and only if b must precede a in the sorted output.  That is Fa&comma;b is a non-strict less than comparison function.  In addition, Fa&comma;b must be defined for all pairs a,b for a and b in the input structure and Fa&comma;b must be transitive, that is, if Fa&comma;b=true and Fb&comma;c=true then Fa&comma;c=true.

• 

By specifying comptype=Function, different styles of comparison functions can be given to sort.  The supported values for comptype are:

– 

nonstrict: the comparison Function is a non-strict less than function, as described above.

– 

strict: the comparison Function is a strict less than function.  That is Fa&comma;b returns true if and only if a must precede b in the sorted output.  Function must still be defined for all pairs of inputs and be transitive, as described above.  This argument is necessary if you want to specify a less than or equal to comparison function and want stable sorting.  Specifying a strict less than function without using the strict option will result in an non-stable, sorted output.

– 

key: the Function maps each element in the DataFrame to a key value.  The DataFrame is sorted by sorting the corresponding keys.  Using a key function is preferable to a comparison function because the key function is called On times, whereas a comparison function will be called Onlogn times.  This is generally faster.  In addition, sorting the keys may be done in parallel, whereas this may not be possible with a comparison function.

• 

If an argument output = sorted is supplied, sort returns the sorted argument. This is the default behavior.

  

If an argument output = permutation is supplied, then sort does not return the sorted argument, but the permutation that would be applied to the argument in order to sort it. The permutation is given as a list of integers: the ith entry of the permutation is the integer j such that the jth entry of DataFrame would occur at the ith position in the sorted argument. This means that if a=sortDataFrame&comma;ByColumn&comma;output=permutation then sortDataFrame could be obtained as DataFramea,...

  

In order to obtain both the sorted argument and the permutation, one can supply the argument output = [sorted, permutation]. This will return a sequence of two elements, the first being the sorted argument, the second the permutation.

Examples

Consider some data on berries.

genusRubus&comma;Vitis&comma;Fragaria&colon;

energy220&comma;288&comma;136&colon;

carbohydrates11.94&comma;18.1&comma;7.68&colon;

top_producerRussia&comma;China&comma;USA&colon;

berriesDataFrameenergy&verbar;carbohydrates&verbar;top_producer&verbar;genus&comma;columns&equals;Energy&comma;Carbohydrates&comma;`Top Producer`&comma;Genus&comma;rows&equals;Raspberry&comma;Grape&comma;Strawberry

berriesEnergyCarbohydratesTop ProducerGenusRaspberry22011.94RussiaRubusGrape28818.1ChinaVitisStrawberry1367.68USAFragaria

(1)

It is possible to sort by any column. Here the DataFrame is sorted in order of ascending energy level:

sortberries&comma;Energy

EnergyCarbohydratesTop ProducerGenusStrawberry1367.68USAFragariaRaspberry22011.94RussiaRubusGrape28818.1ChinaVitis

(2)

In order of descending carbohydrate level:

sortberries&comma;Carbohydrates&comma;`>`

EnergyCarbohydratesTop ProducerGenusGrape28818.1ChinaVitisRaspberry22011.94RussiaRubusStrawberry1367.68USAFragaria

(3)

It is also possible to sort columns with string and named values:

sortberries&comma;Genus

EnergyCarbohydratesTop ProducerGenusStrawberry1367.68USAFragariaRaspberry22011.94RussiaRubusGrape28818.1ChinaVitis

(4)

sortberries&comma;`Top Producer`&comma;length

EnergyCarbohydratesTop ProducerGenusStrawberry1367.68USAFragariaGrape28818.1ChinaVitisRaspberry22011.94RussiaRubus

(5)

The output option controls the returned output for the sort command. Using permutation returns the list of numeric indices that sort the original DataFrame.

index_ordersortberries&comma;Energy&comma;`>`&comma;output&equals;permutation

index_order2&comma;1&comma;3

(6)

berriesindex_order&comma;..

EnergyCarbohydratesTop ProducerGenusGrape28818.1ChinaVitisRaspberry22011.94RussiaRubusStrawberry1367.68USAFragaria

(7)

Several other sort options are also supported by the DataFrame sort command:

berriesAppendberries&comma;DataSeries1&comma;3&comma;2&comma;labels&equals;Raspberry&comma;Grape&comma;Strawberry&comma;label&equals;RandomValues

berriesEnergyCarbohydratesTop ProducerGenusRandomValuesRaspberry22011.94RussiaRubus1Grape28818.1ChinaVitis3Strawberry1367.68USAFragaria−2

(8)

sortberries&comma;RandomValues&comma;key&equals;abs

EnergyCarbohydratesTop ProducerGenusRandomValuesRaspberry22011.94RussiaRubus1Strawberry1367.68USAFragaria−2Grape28818.1ChinaVitis3

(9)

Compatibility

• 

The DataFrame/sort command was introduced in Maple 2017.

• 

For more information on Maple 2017 changes, see Updates in Maple 2017.

See Also

DataSeries/sort

sort