Sorry, you lost me there :) What do you mean by "factor in the object transform"?
Here's what I found:
http://softimage.wiki.avid.com/index.php/Camera_(XSISDK)In this article the author calculates the perspective matrix and gets the camera transformation matrix (Much like what you said) and then calculates the point in screen space by:
Point(x`,y`,0) = Point(x,y,z) * Inverse_Of_Transformation_Matrix * Perspective_Matrix
What I tried was the following (After normalizing the screen-spaced point):
Point(x,y,z) = Point(x`,y`,0) * Inverse_Of(Inverse_Of_Transformation_Matrix * Perspective_Matrix)
However for some reason the results were somehow "disastrous" :) In a sense that no matter what the points were the resulting world-spaced points were (almost) on the same line!
Also the author said that the "script does not take into account shift-Z zooming". So I was wondering if it would handle correctly all other camera types (Orthogonal, Telephoto…)