[ale] wget oddity grabbing pages with URL parameters
pras at cycloeastern.com
pras at cycloeastern.com
Tue Jan 25 10:05:49 EST 2005
Dont know why that happens. However I have noticed that it also does not translate %20 to spaces.
On Tue, Jan 25, 2005 at 09:41:56AM -0500, George Carless wrote:
> Hi all,
>
> I'm having a little strange behaviour with wget... I'm trying to grab some pages from our
> CMS-based intranet site, using the following:
>
> wget -nH -r --level=1 -I valuemap --directory-prefix=/home/httpd/oursite/ http://oursite.com/valuemap/index.html
>
> The problem is, it's grabbing all of the pages, BUT when it does so it seems not to be grabbing any
> pages properly when they're in the format index.html?_function=detail&_op=1234 (which is what my
> CMS uses in this case). The pages ARE downloaded--I wind up with a bunch of files with the
> appropriate file names, but each one has the content of the MAIN index.html as it would return if
> no url parameters were passed. (i.e. my app returns different content depending on the value of
> $_GET["_function"], in PHP terminology).
>
> BUT if I wget the pages manually - e.g. "wget
> http://oursite.com/valuemap/index.html?_function=detail\&_op=1234" - the page is returned
> correctly.
>
> Any of you smart folks have any idea what might be happening here?
>
> Thanks,
> --George
>
> --------------------------------------
> George Carless ... kafka at antichri.st
> Words are just dust in deserts of sound
> _______________________________________________
> Ale mailing list
> Ale at ale.org
> http://www.ale.org/mailman/listinfo/ale
More information about the Ale
mailing list