April 16th, 2014 | 9 Comments

Gnuplot comes with the possibility of plotting histograms, but this requires that the data in the individual bins was already calculated. Here, we start with an one dimensional set of data that we want to count and plot as an histogram, similar to the `hist()`

function we find in Octave.

In Fig. 1 you see two different distributions of measured angles. They were both given as one dimensional data and plotted with a defined macro that is doing the histogram calculation. The macro is defined in an additional file `hist.fct`

and loaded before the plotting command.

binwidth = 4 binstart = -100 load 'hist.fct' plot 'histogram.txt' i 0 @hist ls 1,\ '' i 1 @hist ls 2

The content of `hist.fct`

, including the definition of `@hist`

looks like this

# set width of single bins in histogram set boxwidth 0.9*binwidth # set fill style of bins set style fill solid 0.5 # define macro for plotting the histogram hist = 'u (binwidth*(floor(($1-binstart)/binwidth)+0.5)+binstart):(1.0) smooth freq w boxes'

For a detailed discussion on why `@hist`

calculates a histogram you should have a look at this discussion and the documentation about the smooth freq which basically counts points with the same x-value. The other settings in the file define the width of a single bin plotted as a box and its fill style.

It is important that the two values `binwidth`

and `binstart`

are defined before loading the `hist.fct`

file. These define the width of the single bins and at what position the left border of a single bin should be positioned. For example, let us assume that we want to have the bins centered around 0° as shown in Fig. 2. This can be achieved by settings the binstart to half the binwidth:

binwidth = 4 binstart = -2 load 'hist.fct' plot 'histogram.txt' i 0 @hist ls 1,\ '' i 1 @hist ls 2

Thanks for doing all this tutorials man!

Hi,

This website is just awesome !!! You’ve done a great job. Thanks for that.

But i wonder how to draw a gaussian on the graph (by using a fit curve) with only an one dimensional set of data ?

Thanks.

Arthur

Hi Arthur.

It is possible, but complicated. You can not use smooth together with fit. Hence, we first need to store our data to a temporary file and fit them afterwards. The every command is used to ignore the last line in the hist.txt file, which is not needed. This will then result in Fig. 3 at the end.

Fig. 3Distributions of measured angles and a fitted Gaussian distribution. (code to produce this figure, hist.fct, data)Hi,

Thanks a lot ! It is working very well. I didn’t know that it was possible to store the data. It will be very useful for my future plots.

I hope to see more amazing plots on your website, because I learned lots of things here.

Arthur

Your macro of course breaks down when plotting multiple, histograms with different binwidths or when you do exponential/logarithmic binning. Any nice ready-made solution for that?

I have some troubles with this histograms.

I have to write data on a file (because I have to make some curve fitting on part of this data), and what I get is the following file:

### beginning of gnuplot file

# Curve 0 of 1, 21 points

# Curve title: “name u (int(-log($4))):(1.)”

# x y xlow xhigh type

1 1 1 1 i

2 1 2 2 i

3 1 3 3 i

4 5 4 4 i

5 9 5 5 i

6 18 6 6 i

7 27 7 7 i

8 23 8 8 i

9 16 9 9 i

10 7 10 10 i

11 6 11 11 i

12 4 12 12 i

13 6 13 13 i

14 1 14 14 i

15 1 15 15 i

16 2 16 16 i

17 2 17 17 i

18 1 18 18 i

21 1 21 21 i

26 1 26 26 i

6 1 5.5 6.5 u

###EOF

I do not understand where this last line comes from. Any ideas?

PS: here are the files which have generated these frequancies:

https://dl.dropboxusercontent.com/u/10917026/toy.dat

https://dl.dropboxusercontent.com/u/10917026/stat.gp

Thanks for the article! I have another challenge, is it possible to plot multiple histograms with this method as stacked columns?

Thanks a lot for this code. One further question. How does one change the style to make the histogram colouring transparent?

Thanks for providing this script. It is very helpful.

I encountered a little problem when trying to plot a phase distribution in the range of -pi to pi. Easy copy & past will only work under the condition, that the number of bins has to be integer by the chosen binwidth. Otherwise you get artifacts (actual a bin to much at the right hand side).

I wrote a short improvement for such a chase:

#target binwidth

maxbinwidth = 0.25

#start point

binstart = -pi

#endpoint (rightmost edge of last bin)

binstopp = pi

#calculate number of bins (integer)

nobins = ceil( (binstopp-binstart)/maxbinwidth )

#actual binwidth

print ”

print ‘!!! ALERT !!!’

print nobins, ‘ bins with a width of ‘, binwidth, ‘ are used. This is a difference of ‘, binwidth-maxbinwidth, ‘ of target width.

print ”

then load hist-function.