Advanced search  

News:

cpg1.5.48 Security release - upgrade mandatory!
The Coppermine development team is releasing a security update for Coppermine in order to counter a recently discovered vulnerability. It is important that all users who run version cpg1.5.46 or older update to this latest version as soon as possible.
[more]

Pages: [1]   Go Down

Author Topic: weird url showing up  (Read 4769 times)

0 Members and 1 Guest are viewing this topic.

Walkinman

  • Coppermine frequent poster
  • ***
  • Offline Offline
  • Gender: Male
  • Posts: 373
    • Skolai Images - Nature, Travel and Adventure stock photos
weird url showing up
« on: April 21, 2012, 09:34:11 pm »

Hello

In my looking around to find why my cup usage is higher than it should be, I'm finding a lot of crawlers are hitting urls like this

http://example.com/stock/thumbnails-18-Whitetail-Deer-Photos.htmlhttp:/css/themes/water_drop/displayimage-search-0-260-ArctGrndSqrl_

clearly, the address (SEF plugin) for that page should be http://example.com/stock/thumbnails-18-Whitetail-Deer-Photos.html

but something is also sending a link to the extended (and wrong) url above. And they're crawling a lot of pages with that same kind of url

example.com/stock/thumbnails-18-Whitetail-Deer-Photos.htmlhttp:/css/themes/water_drop/displayimage-search-0-5503-Ski-tracks-in-snow-Wrangell-St-Elias-Natio.html

etc, etc

The only bots I see crawling that stuff are from China, particularly a baidu.com. I'm adding them to my blocked IP addresses, but I'm curious if maybe I have some thing coded incorrectly that's causing the above urls to be crawled.

Thank you.

Cheers

Carl
« Last Edit: May 02, 2012, 11:41:57 am by Αndré »
Logged

Walkinman

  • Coppermine frequent poster
  • ***
  • Offline Offline
  • Gender: Male
  • Posts: 373
    • Skolai Images - Nature, Travel and Adventure stock photos
Re: weird url showing up
« Reply #1 on: April 21, 2012, 10:57:09 pm »

ETA: I also noticed that this weird url is ONLY showing up via one album:

http://www.skolaiimages.com/stock/thumbnails-18-Whitetail-Deer-Photos.html

It doesn't show up with any of the other albums.

I've blocked baidu from crawling my site, but am curious if anyone might have an idea what is generating that set of urls.

Thank you.

Cheers

Carl
Logged

Walkinman

  • Coppermine frequent poster
  • ***
  • Offline Offline
  • Gender: Male
  • Posts: 373
    • Skolai Images - Nature, Travel and Adventure stock photos
Re: weird url showing up
« Reply #2 on: April 21, 2012, 10:58:53 pm »

"cup usage" should read "cpu usage" of course .. it'd be nice if at least some editing of posts were allowed.

Thanks.
Logged

Walkinman

  • Coppermine frequent poster
  • ***
  • Offline Offline
  • Gender: Male
  • Posts: 373
    • Skolai Images - Nature, Travel and Adventure stock photos
Re: weird url showing up
« Reply #3 on: May 02, 2012, 06:20:42 am »

hello - is it possible for an admin to PLEASE edit the first post here, and change the domain name to example.com ... I'm getting hammered by google .. over 5500 'file not found' entries and rising.

What's weird is that once the url-Whitetail-Deer-Photos.htmlhttp:/css/themes/ starts, it then tries to crawl the entire coppermine-gallery with that kind of thing.

I shouldn't have typed the correct domain name in the post. Please edit or delete it.

Thank you.
Logged

Αndré

  • Administrator
  • Coppermine addict
  • *****
  • Country: de
  • Offline Offline
  • Gender: Male
  • Posts: 15765
Re: weird url showing up
« Reply #4 on: May 02, 2012, 11:42:54 am »

Edited as requested.
Logged

Walkinman

  • Coppermine frequent poster
  • ***
  • Offline Offline
  • Gender: Male
  • Posts: 373
    • Skolai Images - Nature, Travel and Adventure stock photos
Re: weird url showing up
« Reply #5 on: May 02, 2012, 07:43:22 pm »

Thanks so much, André. I shouldn't have been so stupid as to post it with the url.

What I don't understand is how a crawler accesses that one url, it then proceeds to try to crawl every link in the site with that string as the precedent. It'll put searches like "displayimage-search-0-260-ArctGrndSqrl_"and search every single keyword, and display a page for each one, with http://example.com/stock/thumbnails-18-Whitetail-Deer-Photos.htmlhttp as the first part of the string. All those pages will appear messed up, as the css doesn't apply correctly.

Thanks again for editing the post.

Cheers

Carl
Logged
Pages: [1]   Go Up
 

Page created in 0.024 seconds with 19 queries.