Well, in general, ATL is not a too bad approach to handle COM-intensive frameworks like DirectShow, but if you want a sample wothout it, take a look at
http://www.codeproject.com/KB/audio-video/VideoImageGrabber.aspx ("Real-time video image processing / frame grabber using a minimalistic approach")