I am trying to create a multipage pdf of about 750 different graphs.

Each graph has around 5,000 - 15,000 data points, giving me roughly 7

million points across the pdf. I make it in a large pdf with a page length

of about 20 inches, and then plot about 10 graphs to a page. So I end up

with basically 75 pages in my pdf. I'm basically trying to graph a line of

XY data points.

The problem, is the pdf is unbearably slow when plotting as a scatter plot

or as a line with markers.

If I make a regular line plot, with no markers, just a single line, it is

plotted and the pdf is fine. But then it connects my points which I don't

want.

I assume this is all because its making the pdf in vector format. And when

I convert it to single lines, I only have ~750 line vectors. But when I try

to scatter plot, or line plot with markers, I end up with millions of

vectors.

I've tried the 'rasterized=True' and that definitely works. But the quality

is really bad. I need to be able to zoom in close on the pdf and still see

rough resolution of the points.

For clarity, I don't actually need to see each individual points. The

graphs have two lines on them, and I just need to be able to distinguish

between the two lines. The two lines are just made up of thousands of

points each.

Is there anyway to keep scalable vectors and do this? Or will I just be

forced to go to a rasterized image file in order to load the pdf in a

reasonable time.

Thanks.

## ···

--

View this message in context: http://matplotlib.1069221.n5.nabble.com/Millions-of-data-points-saved-to-pdf-tp43338.html

Sent from the matplotlib - users mailing list archive at Nabble.com.