Sonic and Sega Retro Message Board: Accidentally found Korean magazine scans - Sonic and Sega Retro Message Board

Jump to content

Hey there, Guest!  (Log In · Register) Help
  • 2 Pages +
  • 1
  • 2
    Locked
    Locked Forum

Accidentally found Korean magazine scans

#1 User is offline Black Squirrel 

Posted 22 September 2018 - 01:01 PM

  • It's sometimes the real thing™
  • Posts: 4496
  • Joined: 27-December 03
  • Gender:Male
  • Location:Northumberland, England
  • Project:New Coke wasn't sold in the UK
  • Wiki edits:20,569
More things for Retro CDN:

https://www.gamemeca.../?mgz=gamechamp

Here's over 100 scans of "Game Champ" (게임챔프), a South Korean video game magazine from back in the day. We could do with this content being mirrored... though it's hidden behind an in-browser javascript reader, which means it's difficult to access by normal means.

I've got this far:
https://www.gamemeca...m=1992_12_1&p=1
(change the number at the end for different pages)


can anyone see a clean way to download entire magazines? Google translate isn't playing ball with this one, and I don't want to be writing scripts to download this stuff en masse if I can help it.


There's also another pressing issue, and you might have noticed it already - I don't understand this magazine at all. This is entirely undocumented in the Western world - there are issues called "Game Power" sprinkled in here - different magazine? Supplements? Name change? Some of these magazines also aren't dated - it's dififcult to document things if we don't know what they are.


Now I know these requests rarely get off the ground, but here's some encouragement as to why we need to care:

Posted Image

Undumped Sonic 2 prototypes you say? But of course!

#2 User is offline Asagoth 

Posted 22 September 2018 - 02:01 PM

  • Behold ... the mighty... the flawless... salted cod eater...
  • Posts: 97
  • Joined: 16-January 17
  • Gender:Male
  • Location:Portugal
Well... it looks like a mix of three or four different magazines ( the "Go Power" one, seems to be a supplement)... I've downloaded a couple of things before... also hidden behind an in-browser javascript reader ... but page by page... :(
This post has been edited by Overlord: 22 September 2018 - 02:14 PM
Reason for edit: Removed un-needed quote of previous post

#3 User is offline Overlord 

Posted 22 September 2018 - 02:13 PM

  • Substitute Meerkovo IT Chief
  • Posts: 17054
  • Joined: 12-January 03
  • Gender:Male
  • Location:Berkshire, England
  • Project:VGDB
  • Wiki edits:3,204
Yeah, I don't see these being pulled out without some sort of scripting. I might have a play tonight, see what I can whip up.

EDIT:

Quote

#! /usr/bin/env python
# -*- coding: utf-8 -*-

import urllib
import time
import os

issue = raw_input("Please enter the issue number: ")		#Takes the issue number
pages = raw_input("Please enter the number of pages: ")		#Takes the number of pages

current_directory = os.getcwd()	#Get the directory path...
os.mkdir(issue)					#Create a directory matching the issue.

fileToDL = urllib.URLopener()	#Create the file handler for the URLs to use

currPage = 1
while currPage <= int(pages):	#Download the magazine
  print "Downloading issue '" + issue + "', page #" + str(currPage) + " of " + pages + '...'
  fileToDL.retrieve("https://www.gamemeca.com/magazine/file.php?m=b&title=gamechamp&ym=" + issue + "&p=" + str(currPage), issue + "/" + str(currPage) + ".jpg")
  currPage += 1	#Set to the next page
  time.sleep(3)	#So we're not hammering the site


print "Done."	#Whee, finished this mag


The first prompt is the ym value, the second is the number of pages in the issue (you need to look manually on the issue's page on the website at the moment, I don't know how to detect if a page doesn't actually exist. I'm not that great a coder). I'm successfully ripping the 2000_12 mag. This works for Python 2.6, anyone else willing to help rip these? =P

#4 User is offline Asagoth 

Posted 22 September 2018 - 03:32 PM

  • Behold ... the mighty... the flawless... salted cod eater...
  • Posts: 97
  • Joined: 16-January 17
  • Gender:Male
  • Location:Portugal
I would help... the problem is that I don't even know how to use that... I know this was a totally unnecessary comment... my apologies for it...

#5 User is offline Black Squirrel 

Posted 23 September 2018 - 04:43 AM

  • It's sometimes the real thing™
  • Posts: 4496
  • Joined: 27-December 03
  • Gender:Male
  • Location:Northumberland, England
  • Project:New Coke wasn't sold in the UK
  • Wiki edits:20,569
https://retrocdn.net..._Supplement.pdf

Ding!

Python 3.x fans will need to replace the "raw_input()" call with "input()" (and maybe other things) - I was impressed to find that a Python 2.x interpreter was already on this box, so that script worked a treat.


This does take a while though (even if you were to knock down the sleep time), so I'd very much like to spread the load as much as possible.

#6 User is offline Overlord 

Posted 23 September 2018 - 01:02 PM

  • Substitute Meerkovo IT Chief
  • Posts: 17054
  • Joined: 12-January 03
  • Gender:Male
  • Location:Berkshire, England
  • Project:VGDB
  • Wiki edits:3,204
Well I've got most of 2000 now so I guess I'll just continue working backwards, if you start from the end and work forwards.

#7 User is offline You-Are-Pwned 

Posted 24 September 2018 - 01:04 PM

  • Posts: 70
  • Joined: 28-August 10
  • Gender:Male
I have downloaded all of the GameChamp scans, but the archive size is about 9.5 gigabytes. Should I upload it somewhere?

#8 User is offline Black Squirrel 

Posted 24 September 2018 - 01:07 PM

  • It's sometimes the real thing™
  • Posts: 4496
  • Joined: 27-December 03
  • Gender:Male
  • Location:Northumberland, England
  • Project:New Coke wasn't sold in the UK
  • Wiki edits:20,569
Few issues in and I'm finding that the page numbers are sometimes a bit screwy. I'm uploading as-is because it's difficult to make the call where pages should be (and the chances of finding replacement scans are slim right now), but it's something to bear in mind.

Also while I don't know if these are just translated Japanese magazines, there is a lot of potentially useful information in there. I've found all sorts and I'm not past March 1993.

View PostYou-Are-Pwned, on 24 September 2018 - 01:04 PM, said:

I have downloaded all of the GameChamp scans, but the archive size is about 9.5 gigabytes. Should I upload it somewhere?

eek

follow my lead:

download this
https://retrocdn.net/File:JPEGtoPDF.7z

make PDFs

upload to
https://retrocdn.net...ame_Champ_scans

#9 User is offline Overlord 

Posted 24 September 2018 - 01:59 PM

  • Substitute Meerkovo IT Chief
  • Posts: 17054
  • Joined: 12-January 03
  • Gender:Male
  • Location:Berkshire, England
  • Project:VGDB
  • Wiki edits:3,204
I'm down to 1999_8_1 now - I also did an update to the script, btw:

Quote

#! /usr/bin/env python
# -*- coding: utf-8 -*-

import urllib
import time
import os

issue = raw_input("\nPlease enter the issue number: ")		#Takes the issue number
pages = raw_input("Please enter the number of pages: ")		#Takes the number of pages

print "" 						#A newline, just to make spacing a bit neater.
startTime = time.strftime('%X')	#Start time for use when we finish up.

current_directory = os.getcwd()	#Get the directory path...
os.mkdir(issue)					#Create a directory matching the issue.

fileToDL = urllib.URLopener()	#Create the file handler for the URLs to use

currPage = 1
while currPage <= int(pages):	#Download the magazine
  print "[" + time.strftime('%X') + "] Downloading issue '" + issue + "', page #" + str(currPage) + " of " + pages + '...'
  fileToDL.retrieve("https://www.gamemeca.com/magazine/file.php?m=b&title=gamechamp&ym=" + issue + "&p=" + str(currPage), issue + "/" + str(currPage) + ".jpg")
  currPage += 1	#Set to the next page
  time.sleep(3)	#So we're not hammering the site


print "\nDownload of issue '" + issue + "' completed at " + time.strftime('%X') + " (started at " + startTime + ")."	#Finished getting this mag.

Nothing new in how the files are downloaded, just a few start/finish timestamps and some formatting changes to make it look a bit prettier. No real urgent need to replace the old one.

EDIT: Welp, hadn't seen the above. Guess I should stop then =P
This post has been edited by Overlord: 24 September 2018 - 02:03 PM

#10 User is offline You-Are-Pwned 

Posted 24 September 2018 - 02:20 PM

  • Posts: 70
  • Joined: 28-August 10
  • Gender:Male
I'm trying to batch upload and having some errors like these. Not sure why:

Posted Image

#11 User is offline Black Squirrel 

Posted 24 September 2018 - 03:59 PM

  • It's sometimes the real thing™
  • Posts: 4496
  • Joined: 27-December 03
  • Gender:Male
  • Location:Northumberland, England
  • Project:New Coke wasn't sold in the UK
  • Wiki edits:20,569
Mediawiki will stop people from uploading the same file more than once - if you've followed exactly the same steps I did, you're probably clashing with the magazines I've already uploaded.


p.s., probably want to keep the same naming scheme (e.g. "GameChamp KR 1993-03.pdf" - things will get lost if there's just numbers in the file name (remember: there are 6000+ files on Retro CDN).

#12 User is offline biggestsonicfan 

Posted 24 September 2018 - 04:51 PM

  • Model2wannaB
  • Posts: 774
  • Joined: 09-May 07
  • Gender:Male
  • Project:Formerly Sonic the Fighters

View PostBlack Squirrel, on 24 September 2018 - 03:59 PM, said:

p.s., probably want to keep the same naming scheme

The biggest of the Retro wiki's flaws imho =/

#13 User is offline Black Squirrel 

Posted 25 September 2018 - 11:42 AM

  • It's sometimes the real thing™
  • Posts: 4496
  • Joined: 27-December 03
  • Gender:Male
  • Location:Northumberland, England
  • Project:New Coke wasn't sold in the UK
  • Wiki edits:20,569

View Postbiggestsonicfan, on 24 September 2018 - 04:51 PM, said:

View PostBlack Squirrel, on 24 September 2018 - 03:59 PM, said:

p.s., probably want to keep the same naming scheme

The biggest of the Retro wiki's flaws imho =/

Descriptive and consistent file names is a strange definition of "flaw".

If this set is missing a magazine, it'll be tough to notice if every file is called random things.

#14 User is offline Overlord 

Posted 25 September 2018 - 01:26 PM

  • Substitute Meerkovo IT Chief
  • Posts: 17054
  • Joined: 12-January 03
  • Gender:Male
  • Location:Berkshire, England
  • Project:VGDB
  • Wiki edits:3,204
Yeah, I agree with Black Squirrel - that's not a flaw, if anything it's a strength.

#15 User is offline You-Are-Pwned 

Posted 02 October 2018 - 03:24 AM

  • Posts: 70
  • Joined: 28-August 10
  • Gender:Male
I uploaded all PDFs here in case anyone wants to take over the uploading work for me:

https://cloud.mail.r.../JfEP/JFidYKM2h
https://cloud.mail.r.../KRR8/YQ9GQEA37
https://cloud.mail.r.../EP54/wr9PCbjyV
https://cloud.mail.r.../MBPo/kHGPezjAj
https://cloud.mail.r.../7tit/Uct6tf1Ko
https://cloud.mail.r.../Hv3T/QeE49MLyD

  • 2 Pages +
  • 1
  • 2
    Locked
    Locked Forum

1 User(s) are reading this topic
0 members, 1 guests, 0 anonymous users