I am trying to create a multipage pdf of about 750 different graphs.
Each graph has around 5,000 - 15,000 data points, giving me roughly 7
million points across the pdf. I make it in a large pdf with a page length
of about 20 inches, and then plot about 10 graphs to a page. So I end up
with basically 75 pages in my pdf. I'm basically trying to graph a line of
XY data points.
The problem, is the pdf is unbearably slow when plotting as a scatter plot
or as a line with markers.
If I make a regular line plot, with no markers, just a single line, it is
plotted and the pdf is fine. But then it connects my points which I don't
I assume this is all because its making the pdf in vector format. And when
I convert it to single lines, I only have ~750 line vectors. But when I try
to scatter plot, or line plot with markers, I end up with millions of
I've tried the 'rasterized=True' and that definitely works. But the quality
is really bad. I need to be able to zoom in close on the pdf and still see
rough resolution of the points.
For clarity, I don't actually need to see each individual points. The
graphs have two lines on them, and I just need to be able to distinguish
between the two lines. The two lines are just made up of thousands of
Is there anyway to keep scalable vectors and do this? Or will I just be
forced to go to a rasterized image file in order to load the pdf in a
View this message in context: http://matplotlib.1069221.n5.nabble.com/Millions-of-data-points-saved-to-pdf-tp43338.html
Sent from the matplotlib - users mailing list archive at Nabble.com.