Archiving data from old floppy disks can be a tedious process at best. Poorly labeled disks combined with slow transfer speeds put it high on the list of things we would rather not do, and it turns out that [Dweller] was of the same opinion. With an estimated 5,000 floppies in his collection, he finally decided it was time to clean house.
With no idea of what was stored where, he decided the best way to go about the process was to read all of the disks, archiving everything, saving the sorting process for later. He originally started by building a floppy autoloader out of Lego Mindstorm parts, which looked good on paper, but performed pretty poorly.
He came across an old floppy duplicator on eBay and figured that since the machine was built for handling gobs of disks, that it was the perfect base for his autoloader. He pulled the mechanical bits from the machine, incorporating them into the rig you see above. He swapped out the duplicator’s brains for an Arduino, which allows him to batch copy his disks and save a picture of each label with little effort.
He says that the system works great, making his life a lot easier (and less cluttered!)
Check out the video below to see his floppy autoloader in action.
[youtube-=https://www.youtube.com/watch?v=H5lkxSY7QsI&w=470]
I would probably recommend using an Harddrive or CF-Card with WHDLoad, but wheres the fun in that :D
whdload is all well & good for the games that support it, and the Amigas I have left do have hard drives.. and their data is already archived.. it was the rather large pile of floppies that were the problem needing solving.. Some of them hold stuff I wrote back then, some have tracker tunes, midi files, data files etc I don’t want to lose, of course, now they are lost in a large dir full of data, but that’s easier to sort through =)
Cool. I’d suggest putting a bin on the exit so you don’t have to pick up all the disks.
Heh.. it was quite fun watching the pile of disks build up on the floor.. and I assume you mean bin in the sense of ‘hopper’ rather than bin in the sense of ‘rubbish’ ;p
I understand it as rubbish because you want to do that one last time and then put everything up on the cloud. ⚡😆⚡
I also had Amigas and I turned my back on a whole bunch of stuff I created.
The swan song for floppy.
We actually still use floppy disks for interfacing with our ABB robots.
Jesus man! Get the adapter kit!
Why has your company not spent a dime on your ABB robots? Get the upgrade kit that has been available for a decade.
This guy is a fucking genius!
Could this not be sped up by loading and archiving a floppy *whilst* the previous one is being photographed, not after?
Yes.. if you don’t use an Intel atom n240 to coordinate it all.. it struggled to stream the disk data while transferring the jpg
Only that now the clutter he has is of 5000 floppies, an autoloader and a huge task waiting in the hard disk, because, could he really throw away the floppies? :)
Lol, it’s like you are reading my mind.. but I had to do this project partly to rescue the data before it became irretrievably lost.. and partly because the space where I was storing the disks was required to hold other stuff.. No doubt I will be receiving strong ‘encouragement’ to lose both the disks and the autoloader before the year is out.. Ideally I’d find someone with a similar problem & pass it on.
I wouldn’t throw away the floppy’s but store them somewhere out of sight. even if they get damaged in storage you now have a backup right?
What an operation! Very cool setup.
Looks great I did mine old school about 10 years ago by hand. took forever. Still didn’t get rid of all the floppies, just for safety sake if the DVD the images are on died.
Heads up but the dyes in burned DVDs do deteriorate to the best of my knowledge. I have a few I burned way back when that can no longer be read. no sunlight or scratches either.
Very nice… too bad there isn’t another purpose for this machine once your done archiving your floppies because it seems work so good ! maybe the code and idea can be used to build a CD/DVD autoloader for batch ripping ?
“A certain” thrust bearing manufacturer uses a process very similar to this to load loose bearings into a machine. Probably in use elsewhere, but there at least it was known as a “stack feeder”.
I wish I had one of those when I was formatting AOL disks by the hundreds
Awesome build :)
I wish I had had one of these in 1999 when I had to leave all my floppies behind to emigrate.
… And a hard disk at current price per gigabyte of course.
===Jac
I wish I had one of these in 1999 before I emigrated and had to leave my floppies behind.
And a hard disk at current price-per-gigabyte of course.
===Jac
Wow, I forgot how slow floppies were. Even at 6x playback, that thing is really slow.
I feel like it takes an inordinately long amount of time to snap a picture. Can’t you trigger it with an optical break and the arduino to snap the picture as the disk slides past, without the solenoid/delay?
Not that simple due to the frame rate on my digicam. Maybe if I could afford better equipments then I could speed things up.
Hmm.. above wasn’t me… delay is due to xfer time.. not frame rate..
I don’t know if he is, but what I’d be doing is auto detecting the top edge of the disk, so that the resulting image is oriented properly.
If the disks are all loaded in the same direction… there’s not much need to orient the image. Just adjust the camera before hand and every label should be in the same place. I think? I’d simply do the adjustment in a batch in Photoshop or something as post-process.
Anyone else think he should now reverse the loader and make it into a floppy disk auto-turret?
yes, all the disks are loaded in the same direction, so all the jpgs have the same shutter orientation, but there was a total lack of agreement as to which way up you labelled a 3.5″ floppy, so I have to rotate the images in the archive management app I’ve created .. which is using CRCs of the disk image, against the Tosec dat files, to auto identify as many as possible, and letting me go in to fill in the rest.. many evenings ahead of me to add metadata for all the other images.. I’m hoping to publish a dat of my own at the end, for any disks that others might have.
To accomplish that you could spend 10 hours writing and testing the code to do that or you can spend 10 minutes adding a second solenoid in parallel and making sure the camera is properly aligned. Personally I like the 10 minute solution.
And most of the time spent taking the photo, is actually the time taken to transfer it back from the camera over USB, which takes a while, but at least it means I get a decent resolution image, autofocused etc, rather than the 320×240 webcam shot I started out with.
You could do the real hacker’s 10 minute solution: Digital camera with its own storage, solder relay to shutter trigger switch. Disk fall sthrough, Arduino brain senses light break photocell, triggers shutter via GPIO/relay.
Download images later and associate with same files by number/order, or write a little util to do it for you. Adjust the disk speed with the angle of the ejection ramp, so there wouldn’t need to be too much trial-and-error with the timing.
I also wouldn’t mind seeing these floppies turn into projectiles via the two-spinning-wheels type launcher. Maybe hack up a tennis serve/pitching machine to shoot them?
It’s a bit slow, because it’s not just reading the data, its storing the raw mfm track timings, and attempting to validate each track as amiga formatted (since thats the bulk of the data) and when it fails a validate, it re-reads the track a few times to see if multiple reads helps it get a valid checksum. So for some raw tracks it stores a bunch load of revolutions worth, to give more data to reconstruct the track from later.. that’s all down to the kryoflux, which as a data preservation bit of kit does an awesome job.. better than the mfm flux encoder I’m building on the STM32 olimexino..
I was wondering about read errors. One of my great enjoyments of leaving floppies behind besides the size issue was the “floppy not formatted” error I got half the time from bad floppies or misaligned drives.
Also not sure why he can’t load a new disk while the ejected one is being imaged?
Problem is latency in the Arduino I2C bus. There just isn’t enough slack.
The sobering part for those of us who remeber floppies is that depending on how he’s got the camera setup, the photo of each label is probably larger than contents of the actual disk itself.
Bigger than the adf yes, the raw track data beats it tho .. its only an 8mpixel camera
Why do images of a floppy-launching smart gun fill my mind?
Sadly the disks are only pushed out by the eject spring inside the drive, it’s one reason I ended up mounting the entire thing 45′ to help get out problem disks.
Gotta wonder if spinning the magnetic part of the disk one direction, while giving the casing opposite rotation might help stabilise the flight tho.. but I doubt the center would spin by itself long enough after the motor disconnects, to help much..
thats what i was thinking!
(PS: cool autoloader, a true hack :))
write the data/coordinates to a file on disk and load into disk hopper
have a turret/autogun that uses the data on the disk (auto-cheating) to shoot it in the right instant and direction ect
like skeet shooting, execpt its all automatic and 100% ACCURATE hehehe
Nice build – no doubt about that.
But now he has 10G of non-sorted stuff he hasn’t needed, hasn’t looked at for who knows how long, stuffed on a hard drive someplace he’ll never need, will never look at, etc.
Maybe he’ll be on the hoarders TV show next.
It’s over 50gb of data, and sure the vast bulk of it I’m unlikely to ever need to use again, then there’s a small chunk I’m likely to want to hang onto for nostalgia value, and a yet smaller chunk that I’ll be happy to have accessible again.. And 50gb is pretty much nothing these days as far as storage goes.. I can dump the entire lot onto a usb key, or just leave it on the home server.. either way its taking up way less physical space than before, and its now accessible & indexable without needing to find the ‘right’ disk in a huge collection.
With storage increasing in size the way it has, storing all your old data becomes easily doable..
If hoarders did a data version, I’m sure there’d be quite a few ppl ahead of me in the list ;p
That’s a lot of unfounded assumptions to pack in to one sentence.
10G? 5000 disks at 1.44MB = 7.2GB ? unless there 720KB? then its more like 3.6GB
and at 250 disk in 12 hours (so 250 a day). It’s going to take him 20 days just to create the backups. Then 15 minutes to make DL-DVD backup.
Funny how it takes him 12 hours to store 360MB of data. I can download that much info in a couple minutes with todays technology.
p.s. replace eject with high power spring maybe? so it can shoot across the room.
You are thinking of the size of the data on the disk.. I’m storing the raw mfm track timings.. over 50gb worth stored..
I was thinking the same thing. It’s like having a personal storage unit or garage where you store all of your crap that you will never need again, but just don’t want to throw away.
This is an awesome hack, though.
That’s actually kind of funny. He has gobs of floppies that he needs to go through and do whatever with. So, to save space and speed up his own UI time, he automates the process of transferring them to a different media, eg HDD.
Here’s the catch. Once he’s done transferring everything to a HDD. Then the time spent writing a database to track and access the files. He’ll do what I do, leave it on the HDD for the rainy day that never comes.
A few years down the road, that HDD will get full so he’ll either transfer the whole gob to a larger capacity drive or, I do this a lot, just outright replace the old drive and store the old one.
In about twenty years we’ll see a HaD post about a dude who needed to auto-extract HDD images to whatever the media of choice is in the future.
:)
I’m pretty far through writing the app to organise & categorise the disk images, (coverdisk, which mag, sound samples, amiga, st, pc, game, etc) .. about 1 in 5 of the disk images were identifiable via CRC against the Tosec database, of the remaining 4 in 5, 1 in 5 are ‘my’ data, 2 in 5 are likely stuff that is well known where my copy is altered somehow (bootblock changes, timestamp changes, etc).. and the last 1 in 5.. who knows, pc formatted? st? mac?
The data is currently on a 27tb array, so I doubt I’ll miss the space, or need to move it onto a new disk for a while yet.. although yes, I do have an entire shelf full of old hdd’s, I started cloning the data from those to the array years ago..
He could be doing this in the first place because he needs/wants the data NOW…not on some rainy day. Or because he’s tired of using tons of floppies for what a HDD can do with a single unit.
I’m not exactly a fan of floppies. I had exactly one floppy disk left in my apartment(Win 3.1, disc 3), and I cut up the mag media last week to make a visible light stop for my IR camera.
Must be a really hairy, awesome porn collection.
Greaaatttt….. fifty thousand grainy, 8-bit, low-res pics.
The ultimate mood killer??? Think about what all those hotties look like NOW.
I’d like to imagine that good percentage are MILF or GILF today. Then again I’m a broke, crippled, middle age man where positive thing is required.
I used the browser word search function at the blog to look for the word error, but it wasn’t found. I wonder how this handles a disk error of any sort? I like how this build, and video documentation is so complete, complete to the point the video’s music track was created by the builder. Nice work.
No, it will be my next project to auto archive my porn mag collection. Still trying to think of ideas for auto page turn, the problem will be pages that are stuck together.
Watch this space.
ROFL.. course web pages don’t get stuck together.. mebbe we need a dojo object for that ;p
WORST MACHINE GUN EVER! ;) Anyway could this thing be adapted to 5 1/4″ disks, got some Commodore disks somewhere.
Crap; I forgot to use the cancel function after responding to comment, before composing a comment directly to the article.
“I wonder how this handles a disk error of any sort?”
The kyroflux is recording raw mfm flux timings, within that there can only be illegal mfm sequences.. I’m asking it to also decode that to ADF (amiga formatted track data), which it uses to decide if the track really was amiga, if the checksum is good/bad. For bad tracks, or unknown tracks, it head jiggles, and rereads the track multiple times. The stored raw data can be used to replay having the disk present, and reinterpet it as different formats without needing the actual disk again. (eg, where the amiga had written 720k dos format for interop)
Won’t help for physical damage, or head misalignment, or sectors lost to LamerII, but helps a lot with regular disk problems.
Great project but what’s the music? Sounds like a piano arrangement for the Dr.Who theme?
Indeed, I was noodling around on Cm, and found the dr who theme fitted nicely on a blues chord set around there, so hit record & had a play.. figured time travel was appropriate for reading floppies ;p
I remember when I parted with my amigas. That last data transfer over. The harsh keep and kill decisions.
Paid my way through college with it. Those were the days.
btw, the images would only be 8bit if they started on a pc. They would be 12 bit on the amiga.4096 color HAM. At a glorious resolution of 320×400.
Unless you’re talking the Amiga 4000. It had more, but I dont recall what it was. Never bought one.
I had 24bit or whatever screens on my A1200*
* it had a BlizzardPPC and a BVision :p
Video toaster 2.0. So I had 24 bit, just not real time. GVP 030 and 040 accelerator cards. 17MB of high speed ram. Ah the days!
I think my toaster box all told was over $10k just inside it. Never mind all the stuff it hooked up to.
Amazing to think I can do more with about $400 in hardware and software now.
i would improve this by the following:
solanoide actuates a horizontal bar instead of just 1 tiny pole. – aligment fixed , even better if you make a V shape so it falls right in there.
meta data filling – it’s kind of simple –
last data written -> time = (+- ejection)time <- last picture taken.
run jpg's trough some OCR to fill in most common bitts (copyright yada yada) and do the special font title yourself.
now make a double or quattro loader that simultaniously eject the floppys so they can be scanned by flatbed scanner.
The solenoid was borrowed from inside the original duplicator, so I went with what I could make work with the least effort there.. even using the tiny pole, no disk ever ‘escaped’ so it did its job.
A V shape to have the disks straighten up a bit would have been a good idea, you have to keep it wider at the top, as the positioning at eject isnt exact, the disk has already fallen 4 or 5 inches. Instead, I’m doing that post using software, it’s pretty simple to have the images rotated & cropped down to just the disk part.
Very few of the disks have info thats worth OCRing, those that do are usually coverdisks, and you get way more metadata for those just by associating the hash of the ADF via Tosec to online info for the disk in question.
All the data is stored disk by disk to a subdir that uses the timestamp of the capture as its name. That’s enough to keep it unique, and I use ADFInfo to pull the amigados volume name, and various amigados check results where applicable.
The adf for each dir is cross checked against Tosec checksums, and against all other dumps by the autoloader, this at least lets me handle dupes easily, and rule out already known disks.
After all that, I still have to hand process roughly 4/5 of the dirs. I found a nice amigados processing library in Scala, and I’ve integrated that, so from the cataloguing tool, I can browse into the adf, to let me confirm if a disk really does contain what the label said.
Beautiful! If I just had a machine like this! I still have to dig through about 1,000 floppies but I don’t dare to do so manually. How to build your setup?
The biggest problem I had when reading old Amiga (and PC) floppies is that they really do not age well, almost all of mine were unreadable after 20-odd years, giving read errors and screeching sounds.
For newbies, there’s a utility called Transdisk which can image a floppy on a real Amiga (as well as put images back onto a floppy), although this will not work with copy-protected games. There are various methods to transfer the resulting images to a PC, much cheaper than this method!
Magnificent!
Especially loved the DW theme music.
5000 floppies – Yikes!
Awesome! Now if you really did/do bring yourself to dispose of the floppies make sure to back up that hard drive!!!!
SWEET! looking for one of these!