[Build 606] Scraper problem or UMM bug?

Post any feedback, problems and bugs from the Pre-Alpha releases here.

Moderator: Spudsdude

[Build 606] Scraper problem or UMM bug?

Postby gotest » Fri Mar 12, 2010 5:28 pm

Thanks for the software. Expected for a long time and gave it a try.

OS: Win 7 - Simplified Chinese

IMDB scraper works well with English folder name. Obviously it is not able to deal with Chinese name.

There are 2 Chinese scarpers in the package downloaded from UMM site. The web site imdb.cn was gone, while Mtime scraper can scrape nothing (in both server/client and standalone client modes). I got another Chinese scraper - 7176. The new scraper was copied to scrapers\video\, and did not appear in the configeditor but did work with movie folders in English name in standalone client mode. However, the all scraped information was displayed as messy code. I went to the movie folder and opened movie.nfo file with notepad, and also found the Chinese characters in messy code.

If put some Chinese characters in the name of the movie folder, even 7176 scraper does not work at all.

I am not sure if it is the scraper problem or the UMM's bug.

Additionlly, in server/client mode, if press "show scarper detail", an error message will pop up. I pasted it here:

http://umm.pastebin.com/djpvbGhS

Another:

If the actor display area is blank (no actor listed) and click on the blank area, it will give an error message:

http://umm.pastebin.com/ZqpjaPb3
gotest
New Member
New Member
 
Posts: 3
Joined: Thu Mar 11, 2010 5:22 am

Re: [Build 606] Scraper problem or UMM bug?

Postby Spudsdude » Sat Mar 13, 2010 10:42 pm

Can you pastebin the debug log file as well as the folder names that you used with the 7176 scraper.

Thanks for the info on the two bugs, i'll get those sorted out.

The imdb cn is located in the svn at
umm\3rdParty\srxml\sxml\trunk\branches\cSharp\TechNuts\ScraperXML Test Program CSharp\scrapers\video

it should also be included in all releases as the builder just copys all of the scrapers from there into the builder
User avatar
Spudsdude
Team UMX Developer
Team UMX Developer
 
Posts: 466
Joined: Fri Sep 04, 2009 9:06 pm

Re: [Build 606] Scraper problem or UMM bug?

Postby Spudsdude » Sun Mar 14, 2010 4:02 am

the two crash bugs should be fixed in 610, which is up for download..

there has been a few new items in the config and I think the database as well, so you'll want to remove the database's and config.xml file
User avatar
Spudsdude
Team UMX Developer
Team UMX Developer
 
Posts: 466
Joined: Fri Sep 04, 2009 9:06 pm

Re: [Build 606] Scraper problem or UMM bug?

Postby gotest » Wed Mar 17, 2010 3:40 pm

Sorry didn't follow the thread promtly.

Actually, the website imdb.cn did not exist any more, so the IMDB.cn scraper is useless now. (I was told imdb.cn had been moved to http://www.7176.com.)


Version 612

Folders:

C:\umm_win32_1.0.0.612\movie\The Matrix
C:\umm_win32_1.0.0.612\movie\风声
C:\umm_win32_1.0.0.612\movie\Feng sheng

Feng sheng is the pinyin of 风声. They are the same film.

The Matrix w/ Mtime scraper: nothing scraped
http://umm.pastebin.com/DneAHCxM

风声 w/ Mtime scarper: Nothing
http://umm.pastebin.com/N8jx3Hwu

The Matrix w/ 7176 scraper: succeeded but messy code
http://umm.pastebin.com/Lc2F5vcn

风声 w/ 7176 scarper: nothing
http://umm.pastebin.com/NtnYKtCU

Feng sheng w/ 7176 scraper: succeeded but messy code
http://umm.pastebin.com/tp7xXx8k

Attach 7176 scraper for your reference.
Scrapers_7176_20091227.rar
7176 scraper
(16.8 KiB) Downloaded 17 times


PS: how to add my own scraper to the server end?
gotest
New Member
New Member
 
Posts: 3
Joined: Thu Mar 11, 2010 5:22 am

Re: [Build 606] Scraper problem or UMM bug?

Postby Spudsdude » Wed Mar 17, 2010 5:11 pm

found a few references to imdb.com in the 7176 scraper, I'm guessing those need to be fixed

I do see the mess of data that's in a scrape for Feng sheng (using 7176), i'm guessing that's a character conversion issue or something along those lines.

Does the 7176 scraper and mtime scraper work in xbmc ?
User avatar
Spudsdude
Team UMX Developer
Team UMX Developer
 
Posts: 466
Joined: Fri Sep 04, 2009 9:06 pm

Re: [Build 606] Scraper problem or UMM bug?

Postby gotest » Sat Mar 20, 2010 5:30 pm

Since I am not home, I cannot test Mtime scraper in XBMC - I knew the 7176 scraper does work with XBMC.

I replaced the Mtime scraper in UMM with a new version and found the new version of Mtime scraper worked like a charm - even can deal with Chinese folder name.

Further, I discovered http://www.mtime.com is encoded in UTF-8 and http://www.7176.com GB2312. I guess UMM may not handle GB2312 properly.

Mtime scraper with 风声:
http://umm.pastebin.com/VhAHcq0a


mtime.xml
(7.85 KiB) Downloaded 16 times


PS:the reference to imdb.com in the 7176 scraper is for the purpose of downloading the poster from imdb.
gotest
New Member
New Member
 
Posts: 3
Joined: Thu Mar 11, 2010 5:22 am

Re: [Build 606] Scraper problem or UMM bug?

Postby Spudsdude » Mon Mar 22, 2010 8:00 pm

Thanks for the mtime information and updated scraper, i've added it to the svn (rev 613)

Also, i'm sure that it's the GB2312 encoding that the scraper doesn't like, I'll see what we can dig up on it.
User avatar
Spudsdude
Team UMX Developer
Team UMX Developer
 
Posts: 466
Joined: Fri Sep 04, 2009 9:06 pm


Return to Pre-Alpha

Who is online

Users browsing this forum: No registered users and 1 guest

cron