[Author Prev][Author Next][Thread Prev][Thread Next][Author Index][Thread Index]

Re: [pygame] Python - Pygame - PyOpenGL performance

To: pygame-users@xxxxxxxx
Subject: Re: [pygame] Python - Pygame - PyOpenGL performance
From: Zack Schilling <zack.schilling@xxxxxxxxx>
Date: Mon, 16 Mar 2009 13:49:01 -0400
Delivered-to: archiver@xxxxxxxx
Delivered-to: pygame-users-outgoing@xxxxxxxx
Delivered-to: pygame-users@xxxxxxxx
Delivery-date: Mon, 16 Mar 2009 13:49:07 -0400
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:from:to :in-reply-to:content-type:content-transfer-encoding:mime-version :subject:date:references:x-mailer; bh=FfzijtbAtx0zhDFzaEZoARCNzeL54cszJ+4ouLopkjs=; b=LvXcWuJy0Qb7DUgUconlUs/nVNu0IPfFmbRi1hjZg58Ls34EVdSlrKzSbgxVUNP0di RIxGT4v6PoJWNNaeCxGiOtxut56WbueyyaqL/Jsdxm0A3/1ph+8FINF9Q5VmiLKiUZo4 ANWbOFPqOAW9t/c24/eXBKIVuspxZurcWL6EQ=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:from:to:in-reply-to:content-type :content-transfer-encoding:mime-version:subject:date:references :x-mailer; b=MDJP3doyfON5nJ/d4fDDE/Hn36kV/Nkbp3ztXguZJg/lljAkgr1AZPCOzfK2J7nl8l FOEZmzFxiDkH/ziYPP6hBc1OXWT/ttCR3F4FIqUbAEn88Q4p7CatTA2vihZLAUElrcxj B1AaPWr71YY38NZUyu1wT24MmADGGhMg1wTY4=
In-reply-to: <b572c9e10903161000y5f794490s81ba985523eea418@xxxxxxxxxxxxxx>
References: <F0CF2F90-6467-4392-ABFB-AEFA89C35695@xxxxxxxx> <138600AC-8F34-4C8E-9F89-EA3B70EC061D@xxxxxxxxxxx> <20090227151929.GA2588@home> <b572c9e10903161000y5f794490s81ba985523eea418@xxxxxxxxxxxxxx>
Reply-to: pygame-users@xxxxxxxx
Sender: owner-pygame-users@xxxxxxxx

If someone did this and I could drop it in to my code, that would bevery nice. But for right now, PyOpenGL is serving my needs just fine.I can use about 600 independently textured and animated spritesonscreen, scaled and rotated, without stressing a low-end system morethan 40%.

40% is a significant amount of overhead, but Peter is wrong of a fewpoints. You certainly can animate a sprite in OpenGL using texturecoords. Just load all your animation frames, convert to strings, stickthem all together, and pass to OpenGL as one very tall texture. Thisworks perfectly fine. That means VBOs are definitely suitable. You canalso push a VBO up piecemeal, changing the active texture betweenparts (and achieving the expected effect).


Everything you see here is done with pygame and PyOpenGL: http://www.youtube.com/watch?v=cBFoXqKrBa8

Positioning the quads directly doesn't seem to be too much of anissue, since in my game, they move each frame anyway. The cost ofadding coordinates in Python and pushing them into a numpy array ismuch less than an OpenGL push, translate/rotate, pop call for each andevery sprite. It makes a lot of sense to me that this would be thecase in other languages as well.


-Zack

On Mar 16, 2009, at 1:00 PM, Forrest Voight wrote:

Would writing a replacement for PyOpenGL in C instead of in Python
with ctypes help? I think it really would ... PyOpenGL is internally
pretty complex, sometimes when I get tracebacks the error is 5 or 6
levels into PyOpenGL. Even a C library that only implemented the
common functions and relied on PyOpenGL for the constants and
functions that do complex things like handling strings would probably
help a lot.

On Fri, Feb 27, 2009 at 11:19 AM, Peter Gebauer
<peter.gebauer@xxxxxxxxxxxxxxxxxxxxx> wrote:
Hi!

I've done a few sprite thingies in OpenGL here are some pointers:

Afaik display lists and VBO's can't bind different textures (?)
per list/array. You can't animate lists by changing texcoords
independently per element, so no go. VBO's have texture coords,
but only one texture. Again, I'm no expert, might be wrong.

With the quad aproach you should try
to make the number of calls as few as possible. If you get
rid of the push and translate for each sprite you'll get some
extra speed. Try positioning each quads directly. The downside
with sharing matrix over all sprites is the obvious lack of
using OpenGL transformations, but some vector math aplied to
the quads has been faster for me than having one transformed
matrix per quad.

Since I haven't been able to animate a list/vbo with independent
textures and texture coords for each element/buffer object I've only
used it for backdrops. The speed increase is tremendous.
I also partition the elements so only one list/vbo is displayed per
visible section, if you're screen display is smaller than the
entire scene, this helps even more.

If you put all your sprites and their animation frames into one
big texture you could use VBO's, but I've never had the tenacity
to try that aproach.

Another way to increase speed is to write an opengl rendering engine
in C and call and make it available as a Python extension. This is
a major speed boost, in particular for a large number of iterations.
Iirc PyOpenGL bindings are generated, many times this is suboptimal
code for what you're trying to do, writing the Python extension in C
manually have been faster for me many times. This is indeed true
if you put your iterations inside a C loop instead of calling the
C function from Python many times.

In any case, still waiting for that OO 2D game engine with tons of
OpenGL features and effects, including simple things like frameanimation,LERP-like features and a simple 2D scenegraph. No luck yet, allattempts
I've tried so far lack at least one "must have" feature. :)

/Peter

On 2009-02-26 (Thu) 11:29, Casey Duncan wrote:
Immediate mode calls (glVertex et al) are the very slowest way touseOpenGL. In fact they are deprecated in OpenGL 3.0 and willeventually be
removed.
The display list is better as you discovered, but you still aremaking afew OpenGL state changes per sprite, which is likely slowing youdown.Also there is some overhead for the display list call, which makesthem
sub-optimal for just drawing a single quad.
       glPushMatrix()
       glTranslate(self.positionx,self.positiony,0)
       glCallList(self.displist)
       glPopMatrix()
You really need to batch the quads up into a few vertex arrays orvbos
to stream them to the card in one go. pyglet has a high-level python
sprite api that automates this for you fwiw.

-Casey

On Feb 26, 2009, at 11:04 AM, Zack Schilling wrote:
I know the PyOpenGL mailing list might be a better place to askthisquestion, but I've had a lot of luck talking to the experiencedpeople
here so I figured I'd try it first.

I'm trying to migrate a game I created from using the Pygame / SDL
software rendering to OpenGL. Before attempting the massive and
complex conversion involved with moving the whole game, I decidedto
make a little test program while I learned OpenGL.
In this test, I set up OpenGL to work in 2D and began loadingimagesinto texture objects and drawing textured quads as sprites. Icreated alittle glSprite class to handle the drawing and translation. Atfirst
its draw routine looked like this:

       glPushMatrix()
       glTranslate(self.positionx,self.positiony,0)
       glBindTexture(GL_TEXTURE_2D, self.texture)
       glBegin(GL_QUADS)
       glTexCoord2f(0, 1)
       glVertex2f(0, 0)
       glTexCoord2f(1, 1)
       glVertex2f(w, 0)
       glTexCoord2f(1, 0)
       glVertex2f(w, h)
       glTexCoord2f(0, 0)
       glVertex2f(0, h)
       glEnd()
       glPopMatrix()
Note: self.texture is a texture ID of a loaded OpenGL textureobject.My sprite class keeps a dictionary cache and only loads thesprite's
image into a texture if it needs to.
I'd get maybe 200 identical sprites (same texture) onscreen andmy CPUwould hit 100% load from Python execution. I looked into whatcould becausing this and found out that it's probably function calloverhead.
That's 14 external library function calls per sprite draw.
The next thing I tried was to create a display list at eachsprite's
initialization. Then my code looked like this:
       glPushMatrix()
       glTranslate(self.positionx,self.positiony,0)
       glCallList(self.displist)
       glPopMatrix()
Well, that's nice, down to 4 calls per draw. I was able to push~500sprites per frame using this method before the CPU tapped out. Ineedmore speed than this. My game logic uses 30-40% of the CPU aloneandI'd like to push at least 1000 sprites. What can I do? I'velooked intopassing sprites as a matrix with vertex arrays, but forming apropervertex array with numpy can sometimes be more trouble than it'sworth.
Plus, I can't swap out textures easily mid-draw, so it makes things
much more complex than the simple way I'm doing things now.
Is there any design pattern I could follow that will get me morespeed
without sending me off the deep end with complexity.

Thanks,

Zack

Follow-Ups:
- Re: [pygame] Python - Pygame - PyOpenGL performance
  - From: Brian Fisher
- Re: [pygame] Python - Pygame - PyOpenGL performance
  - From: Forrest Voight

References:
- Re: [pygame] Python - Pygame - PyOpenGL performance
  - From: Forrest Voight

Prev by Author: Re: [pygame] Weird lag
Next by Author: Re: [pygame] GSoC Easy simple software 3d
Previous by thread: Re: [pygame] Python - Pygame - PyOpenGL performance
Next by thread: Re: [pygame] Python - Pygame - PyOpenGL performance
Index(es):
- Author
- Thread