Home  |  Forums  |  914 Info  |  Blogs
 
914World.com - The fastest growing online 914 community!
 
Porsche, and the Porsche crest are registered trademarks of Dr. Ing. h.c. F. Porsche AG. This site is not affiliated with Porsche in any way.
Its only purpose is to provide an online forum for car enthusiasts. All other trademarks are property of their respective owners.
 

Welcome Guest ( Log In | Register )

> SOT: Programmers and Database Builders, Last bump.... Bummer
McMark
post May 7 2015, 10:45 AM
Post #1


914 Freak!
***************

Group: Retired Admin
Posts: 20,180
Joined: 13-March 03
From: Grand Rapids, MI
Member No.: 419
Region Association: None



I'd like to transcribe the PET file into a usable database. I've tried a few times to undertake this project myself, but it's daunting. So I realized that we could set up a site where our members could add a little bit of the data at a time. With everyone's help, it'll be done in no time. Having this data available will enable future developments, such as adding real pictures of the parts, better how-to threads, linking to part numbers in posts, etc. Andy and I have both planned on setting this up, but neither of us has actually found the time to get started. Since this doesn't really need to be tied into the 914World forum in any way, we don't need to build it on the forum servers. We can set this up independently and then import the completed database file when we're done...

Anyone interested in helping with this project? Here's a bit of overview on what I had planned:

***Split the PET file into JPG/GIF files***
I planned on splitting the file up into usable image files, which could also be stored in the database (it's own table?). The tricky part, is that besides splitting the PDF by page numbers, we also have to split SOME of the pages in half.

***Phase 1 Data Entry***
This one is more simple, just build a page that will display one of the PET images (not the exploded diagrams, just the parts list) and display a HTML form that matches the formatting, so a user could log in, and transcribe line by line as much of the image as they felt like. The form should save the data automatically (AJAX) so the user doesn't have to complete a page, or remember to click save, etc. This means that when a user 'starts work' they could be presented with a partially complete image to add data to. In that case, it would also be useful to add a checkbox at the end of each line used to indicate that the previous work has been double-checked and is correct. Once all of the data is entered, new requests for 'work' would be presented with completed images for double-checking. Once a line has been triple-checked, it could be locked as accurate. Eventually we would have all the data transferred and triple checked.

***Phase 2 Real World Descriptions***
Since a lot of the listing in the PET are translated from German incorrectly, it would be worthwhile to go through all the listings again to translate them. This would be a slightly different process from above. We would display an exploded diagram and the details for that image from the database, not from the PET images. The only form field would be an [i]additional[i] field for a new description. I think it would be useful to maintain and original listing of the description from the PET, as well as our own description. It would also be useful to collect multiple descriptions, which may not be shown publicly, but would be useful for searching for parts. For something like the 'Taco plate', it's listed in the PET as 'cover for oil sump' but everyone knows it as a taco plate. But it could also be called an oil temp sender plate. All of these descriptions would be useful for searching.

***Phase 3 Further Expansion***
This phase is probably where the project would end and the data integrated into the forum software, and future development handled by Andy or myself. But in order to describe the full process, I've included it here. This phase would be where members could add pictures of the parts (alone or on the car), as well as things like original finishes (paint, plating, etc), manufacture materials, possible replacements (using 911 Sport Mounts instead of Transmission Mounts).


Attached thumbnail(s)
Attached Image
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
 
Reply to this topicStart new topic
Replies
Mike Bellis
post May 8 2015, 10:51 PM
Post #2


Resident Electrician
*****

Group: Members
Posts: 8,347
Joined: 22-June 09
From: Midlothian TX
Member No.: 10,496
Region Association: None



I'm unlocking it and running text recognition as we speak...

I mean, no that's not what I'm doing... (IMG:style_emoticons/default/biggrin.gif)

My dumputer is running sloooww right now...
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

Posts in this topic
McMark   SOT: Programmers and Database Builders   May 7 2015, 10:45 AM
SirAndy   - Is this in a PDF? - If so, is the text on the ri...   May 7 2015, 10:55 AM
BeatNavy   This is a very cool idea. - Is this in a PDF? - I...   May 7 2015, 11:42 AM
stevegm   - Is this in a PDF? - If so, is the text on the r...   May 7 2015, 01:14 PM
type47   Have you seen the Parts Vault on this site (sub ca...   May 7 2015, 11:02 AM
7TPorsh   Maybe set it up like a Wikipedia site. Dump all th...   May 7 2015, 11:17 AM
gms   I put all the parts numbers and descriptions in a ...   May 7 2015, 11:35 AM
McMark   Here's the extracted text. The problem is tha...   May 7 2015, 12:46 PM
Andyrew   Jpegs can be converted to PDF pretty easily... I...   May 7 2015, 01:29 PM
McMark   How many pages is this PET file? 330, but accura...   May 7 2015, 01:54 PM
bandjoey   I think it's a great idea but as usual with P-...   May 7 2015, 02:00 PM
SixerJ   Really cool idea, a while ago I transcribed the 91...   May 7 2015, 02:33 PM
McMark   :bump: Anyone want to take lead on this? I was h...   May 8 2015, 10:27 AM
McMark   Okay, last :bump: I thought we would get some he...   May 8 2015, 09:49 PM
Mike Bellis   I'm unlocking it and running text recognition ...   May 8 2015, 10:51 PM
Mike Bellis   Sure is taking a long time... :( I'll attach ...   May 8 2015, 11:24 PM
Mike Bellis   Here it is, in all it's glory. Unlocked and w...   May 9 2015, 09:05 AM
McMark   Here it is, in all it's glory. Unlocked and ...   May 9 2015, 11:17 AM
Mike Bellis   The PET already was searchable... :confused: plu...   May 9 2015, 03:49 PM
Mike Bellis   Now I'm converting it to a word doc to see wha...   May 9 2015, 09:31 AM
altitude411   :cheer: :cheer: :cheer: Operation " black...   May 9 2015, 09:38 AM
Kansas 914   Mike - well done! The search works like a char...   May 9 2015, 09:59 AM
ConeDodger   Mark, I have an original Porsche hard copy if you...   May 9 2015, 09:59 AM
Mike Bellis   A word doc version is on the link. I am now trying...   May 9 2015, 10:07 AM
Mike Bellis   the excel version is taking longer. I will update ...   May 9 2015, 10:13 AM
Mike Bellis   I added a text version and a very basic excel vers...   May 9 2015, 04:33 PM
McMark   That looks a lot like what I posted a few days ago...   May 9 2015, 04:59 PM
Mike Bellis   Well, here's how it works. Take the text file,...   May 9 2015, 06:23 PM
nsyr   I have a spider program that can put pdfs into a d...   May 10 2015, 02:04 AM


Reply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 



- Lo-Fi Version Time is now: 1st July 2025 - 06:01 AM