Web Content Mining for Pokenet

DerNalia

New Member
Joined
Feb 25, 2010
Messages
6
Reaction score
0
So, Bulbapedia is amazingly thorough..
I'm interested if there is a way to mine the Bulbapedia to use for the free, opensource game, pokenet.
 
For our game, pokemon are read in from the file in the following format:

[1]
Name=BULBASAUR
InternalName=BULBASAUR
Kind=SEED
Pokedex=BULBASAUR can be seen napping in bright sunlight. There is a seed on its back. By soaking up the sun's rays, the seed grows progressively larger.
Type1=GRASS
Type2=POISON
BaseStats=45,49,49,45,65,65
Rareness=45
BaseEXP=64
Happiness=70
GrowthRate=Parabolic
StepsToHatch=5120
Color=Green
Habitat=Grassland
EffortPoints=0,0,0,0,1,0
Abilities=OVERGROW
Compatibility=1,7
Height=0.7
Weight=6.9
GenderRate=FemaleOneEighth
Moves=1,TACKLE,4,GROWL,7,LEECH SEED,10,VINE WHIP,15,POISONPOWDER,15,SLEEP POWDER,20,RAZOR LEAF,25,SWEET SCENT,32,GROWTH,39,SYNTHESIS,46,SOLARBEAM
EggMoves=LIGHT SCREEN,SKULL BASH,SAFEGUARD,CHARM,PETAL DANCE,MAGICAL LEAF,GRASSWHISTLE,CURSE
Evolutions=IVYSAUR,Level,16
BattlerPlayerY=16
BattlerEnemyY=14
BattlerAltitude=0


So. I was thinking... "man.. it would absolutely suck to create / edit this huge list" ... and then it occurred to me... why not just have a program update the list from bulbapedia?! Bulbapedia has all this information.. no need to find it ourselves.

I've never done content mining before, but it would make the devs over at Pokenet really happy. =)
 
So long as the game fits in with the CC BY-NC-SA license, taking the info is totally fine. Having it constantly updating however might be an issue.
The main reason the infamous iPhone Pokédex app was such a huge problem was partly because it took BP's information without credit for commercial use, but but also because, by accessing BP's information directly, it was using our servers, causing us unnecessary lag and server strain while entirely skipping the ads that keep our server up and running.

I don't know enough about our servers to say anything more on the issue, maybe if it only updated occasionally it would be totally fine. Depends on how you plan to do it, and where the data ends up being stored for the game. But I don't know.
 
We wouldn't update that often. We'd probably only update if someone claims a pokemon can't learn a move that its supposed to or something like that..

As for licensing, we have no problem crediting you guys. We are also open source.

The data is all going to be in an ini file on the server machines. ^_^
 
While I'm uncomfortable with a lot of the fan made Pokémon games in general (especially where they're a type of game that's essentially in direct competition with existing Pokémon franchise games), so long as you abide by the terms of our usage license, we can't exactly say no.
 
we will give credit, no need to worry -- and open source isn't about money. it's about the community. ^_^

Question though..
I know this is a really crazy request... considering you guys don't know or trust me... but if all the pokemon information is stored in a database... I was wondering if I could have read only access to the database... for the sole purpose of creating templates like the one in one of my previous posts =/

That way I could easily write a program to pull data and sort it as needed.
 
I know.. but. twould be a handy option... I'm still trying to figure out how to make a spider for scraping web pages
 
Alright, that's fair.
I'm starting to get the hang on this scraping anyway =p

Thanks for the help guys!
 
Please note: The thread is from 16 years ago.
Please take the age of this thread into consideration in writing your reply. Depending on what exactly you wanted to say, you may want to consider if it would be better to post a new thread instead.
Back
Top Bottom