Complete Digitization of Leonardo da Vinci's Codex Atlanticus
144 points
2 days ago
| 10 comments
| openculture.com
| HN
WillAdams
2 days ago
[-]
Interesting UI --- wants a full-screen mode and 2-up view and a way to remove all the chrome/UI....

An earlier example of this sort of thing was Bill Gates' purchase of the Codex Leceister https://en.wikipedia.org/wiki/Codex_Leicester which was then digitized and released on a CD-ROM by Corbis:

https://en.wikipedia.org/wiki/Leonardo_da_Vinci_(video_game)

which was quite engaging, but sadly trapped in the technology of the time --- anyone know of an updated version of it?

reply
mzs
2 days ago
[-]
reply
Isamu
2 days ago
[-]
If you get an opportunity to see them in person, it’s worth it because the fine details are that much more impressive up close. Every photo I’ve seen is not as good. Also the illustration is tinier than you would think.
reply
kragen
2 days ago
[-]
This is beautiful! I am having some difficulty with the UI; is there a torrent? Images like https://codex-atlanticus.ambrosiana.it/assets/500/000R-1.jpg are too low in resolution for good archival; you can't even read the writing.
reply
trvz
2 days ago
[-]
Manipulate the URL for a higher resolution:

  https://codex-atlanticus.ambrosiana.it/assets/2000/000R-1.jpg
You don't need to depend on others to create a torrent, as bestowed upon you was the power of wget!

  wget https://codex-atlanticus.ambrosiana.it/assets/2000/000R-{1..1119}.jpg
  wget https://codex-atlanticus.ambrosiana.it/assets/2000/000V-{1..1119}.jpg
reply
kragen
2 days ago
[-]
Thanks! On my cellphone not even enough of the UI was working for me to discover those URLs. I suspect a certain amount of error recovery is in order for wgetting all 2238 images. 2000 seems to be the maximum resolution available, which is under 100dpi. A few of the images seem to have been uploaded to https://commons.wikimedia.org/wiki/Category:Codex_Atlanticus.

There are a couple of scans of a 43-page Italian edition published by Ulrico Hoepli on the Archive: https://archive.org/details/codex-atlanticus-leonardo-da-vin... https://archive.org/details/codex-atlanticus-leonardo-da-vin... but they seem to be of very poor quality.

I'm done downloading now (with a sleep of 1 second between pages), and I have 1064125470 bytes of JPEG files, a very reasonably torrentable size. I'll see if I can put together a torrent and upload to the Archive and Commons...

reply
WithinReason
2 days ago
[-]
Or in PowerShell on Windows:

  1..1119 | % { iwr "https://codex-atlanticus.ambrosiana.it/assets/2000/000R-$_.jpg" -OutFile "000R-$_.jpg" }
  1..1119 | % { iwr "https://codex-atlanticus.ambrosiana.it/assets/2000/000V-$_.jpg" -OutFile "000V-$_.jpg" }
reply
embedding-shape
2 days ago
[-]
Some people around me swear PowerShell has better user experience than unix shells, but then I keep seeing examples like these. How on earth could people prefer this compared to `wget https://codex-atlanticus.ambrosiana.it/assets/2000/000V-{1.....`?
reply
kragen
2 days ago
[-]
In this case presumably the main difference is not PowerShell vs. bash but iwr vs. wget? Because I think this is roughly equally bad (untested):

    for page in {1..1119}; do
        iwr "https://codex-atlanticus.ambrosiana.it/assets/2000/000R-$page.jpg" -OutFile "000R-$page.jpg"
        iwr "https://codex-atlanticus.ambrosiana.it/assets/2000/000V-$page.jpg" -OutFile "000V-$page.jpg"
    done
Also until recently bash didn't have {42..53} syntax. You had to use `seq`. There was an alternative name for `seq` in Unix Power Tools, `jot`, because it wasn't standard: https://docstore.mik.ua/orelly/unix/upt/ch45_11.htm. This section was by ORA author and sysadmin Linda Mui (https://www.oreilly.com/pub/au/268), but I don't know if she wrote `jot` or just popularized it.
reply
NoMoreNicksLeft
2 days ago
[-]
Any idea on how to best compile it to an ebook? Just stuffing the jpgs into a pdf rarely works well...
reply
foofoo12
2 days ago
[-]
I usually do what rarely doesn't work well for you, but it works decently for me. You get 1 page per image and the image isn't compressed or touched at all.

  apt install img2pdf
  img2pdf *.jpg -o leonardo-da-book.pdf
reply
nunodonato
2 days ago
[-]
wouldnt this mess up the order? I think you are supposed to view it like R1, V2, R2, V2, etc
reply
foofoo12
2 days ago
[-]
Yes, this was just an example. Using wildcard expansion will give you whatever order the your current shell seems fit. Bash does alphabetical order.
reply
kragen
2 days ago
[-]
More like

    echo $(for page in {1..1119}; do for side in R V; do
      echo "000$side-$page.jpg"; done; done)
reply
c0balt
2 days ago
[-]
I haven't that done this in some time, but templating some markdown code for pandoc and creating an ebup might be a viable avenue.
reply
kragen
2 days ago
[-]
Maybe what rarely works well for NoMoreNicksLeft is having a gigabyte of JPEGs in a single HTML chapter inside the epub? In that case you could do something like divide the files into 373 "chapters" of 6 pages each?

One of the fragmentary editions I linked on the Archive uses the .cbr Comic Book Reader format; perhaps that is a better format than .epub for high-resolution scans of every page?

reply
NoMoreNicksLeft
2 days ago
[-]
Oooh... I have even less luck with epub, when the pages are an image-per-page.
reply
ticulatedspline
2 days ago
[-]
Easy way would be to just drop them in a zip and label it .cbz. Most readers handle CBR/CBZ just fine.
reply
kragen
2 days ago
[-]
Oh, is .cbz that simple? Does it use the file order of the zipfile members or some other order? (https://acbf.fandom.com/wiki/ACBF_Editor_-_Creating_Metadata says it uses alphabetical order, which is the wrong order in this case.)

It may be useful to use zip -Z store. JPEG data isn't going to get much benefit from another layer of LZ77.

reply
atoav
2 days ago
[-]
Calibre comes with a ebook-convert command, that one might work
reply
eMPee584
2 days ago
[-]
ocrmypdf (rocks!)
reply
whatever1
2 days ago
[-]
How much talent can fit in a person? This is how much.
reply
nunodonato
2 days ago
[-]
indeed! The biography of Leonardo was an amazing read. Highly recommend it
reply
proee
2 days ago
[-]
Can you recommend the author?
reply
nunodonato
2 days ago
[-]
https://en.wikipedia.org/wiki/Leonardo_da_Vinci_(Isaacson_bo...

The same author who wrote some other famous biographies. I know some people prefer other DaVinci's biographies. I didn't read others to be able to compare, but I really enjoyed this one.

reply
kragen
2 days ago
[-]
Nitpick: "da Vinci" wasn't our homeboy's name. That just means "from Vinci". He was "Leonardo", like many other people, so we added "da Vinci" to clarify which Leonardo we meant, just like you might say, "Jessica from church came by," to clarify that you didn't mean Jessica the ex-girlfriend. Surnames weren't very widely used in Italy then.

It's like "Jesus of Nazareth"; you wouldn't talk about "other OfNazareth's biographies". Ain't grammatical.

reply
cma
2 days ago
[-]
It's fine. John Smith once meant the John who works as a blacksmith etc. Whatever the original meaning we now widely take da Vinci to be the last name if we don't speak Italian.
reply
dragonwriter
1 day ago
[-]
> John Smith once meant the John who works as a blacksmith etc.

Yes, modern surnames largely evolved from descriptive epithets added to distinguish different people of the same given name, but that doesn't retroactively transform the descriptive epithets of commonly applied to people who lived in the past into surnames for those people.

reply
kragen
2 days ago
[-]
I agree that the error is common. Try to make new errors instead of repeating common errors.
reply
card_zero
2 days ago
[-]
Does this also apply to DiCaprio? His name seems to translate as "the deer's Leonardo", or maybe "the goat's Leonardo". Possibly "son of a goat".

Wikipedia says that Leonardo da Vinci was properly Leonardo son of Piero from Vinci son of Antonio son of another Piero son of Guido. I'm not sure that moving to surnames was a mistake, you know.

reply
kragen
2 days ago
[-]
Nope, that's his actual surname. He wasn't born in the 16th century.
reply
card_zero
2 days ago
[-]
But at some point back in time, when an ancestral DiCaprio was first referred to as just "DiCaprio", that was an error, right? He should properly be called Quello Figlio di Caprio, that son of a goat. It's not too late.
reply
kragen
1 day ago
[-]
Probably not, no, and AFAIK Leo is a nice guy who doesn't deserve to be deprecated in that way.
reply
cma
1 day ago
[-]
Descriptive linguistics, how stuff is actually used, is a lot more useful in practical real-life communication vs prescriptive.

Da Vinci is a shorthand that everyone will understand vs just calling him Leonardo. Writing Leonardo da Vinci will be more explicit but will come off much more formal and stilted.

reply
kragen
1 day ago
[-]
Nobody who knew Leonardo called him "da Vinci", any more than you would call Jessica "from church" ("Hey, is From Church coming over tonight?") or Venezuelans would call Hugo Chávez Frias "Mr. Frias". "Descriptive linguistics" is not a magic trump password that makes all erroneous utterances correct. If you haven't studied 16th-century Italian, you're going to make errors when you name 16th-century Italians.
reply
cma
1 day ago
[-]
> any more than you would call Jessica "from church" ("Hey, is From Church coming over tonight?")

That's not in common use, so wouldn't fall under descriptive linguistics. No English speaker was puzzled at whether Dan Brown's The Da Vinci Code was about someone else from Vinci's code. It's an established convention at this point.

reply
kragen
1 day ago
[-]
The established convention is to use the given name "Leonardo", just as with Raphael, Michelangelo, and little Donato ("Donatello"). Dan Brown is also not an authority on the descriptive linguistics of 16th-century Italian.
reply
cma
1 day ago
[-]
The Dan Brown mention was about descriptive, showing it's popularly understood using a popular book, not prescriptive.
reply
felixbraun
2 days ago
[-]
Excellent work — reminds me of similar projects built with the same tech stack:

– Coins: A journey through the Münzkabinett Berlin collection (one of the largest in the world). https://uclab.fh-potsdam.de/coins/

– Theodor Fontane Marginalia: A visualization of Fontane’s marginalia and notes in his personal library. https://uclab.fh-potsdam.de/ff/

reply
vim-guru
2 days ago
[-]
Why are some of the pages upside down?
reply
embedding-shape
2 days ago
[-]
It's a bit bananas, but probably just because he could. He also wrote his personal notes in "mirror writing":

> The notes on Leonardo da Vinci's famous Vitruvian Man image are in mirror writing. Leonardo da Vinci wrote most of his personal notes in mirror writing, only using standard writing if he intended his texts to be read by others

https://en.wikipedia.org/wiki/Mirror_writing

reply
foofoo12
2 days ago
[-]
Da Vinci was showing off.
reply
b34k3r
2 days ago
[-]
just rotate your monitor
reply
nunodonato
2 days ago
[-]
amazing! The categorization is nice, but I would love to see some sort of "tag cloud" that would allow use to view more specific content. How long until someone creates a tool to RAG the hell out of this? :)
reply
MangoToupe
2 days ago
[-]
> We use it to express mild surprise that one person could use both their left and right hemispheres equally well.

When did this myth become so perpetuated? It's infuriating. I blame university administration. I can't think of any other reason to so firmly distinguish different areas of thought.

reply
NaomiLehman
2 days ago
[-]
I'm training a model based on this /s
reply