2019 | Internet Archive Blogs (2024)

Posted on October 21, 2019 by John Hickey

Minot State University in North Dakota had reason to love the idea of having the thousands of elderly recordings its library had in storage. Who doesn’t love the idea of having access to that kind of American musical history?

Minot State’s History 78s Project wanted to get its collection of approximately 15,000 78s out to a wider audience and is using Internet Archive to get the biggest bang.

One problem. The school community didn’t have any access to those discs. Playing 78-rpm records in the day of iPods, Pandora and Spotify is no easy feat.

The collection of 78-rpm discs covering classical to early jazz, blues and country music was historically significant. But without digital access, the music wasn’t being taken in by the student body, faculty, alumni or the wider world. And so Minot State is donating approximately 15,000 recordings to the Internet Archive so they can be digitized and preserved.

“We’re tremendously excited that the Internet Archive will be able to preserve this rare slice of America’s early recording history,” said Minot State history professor Daniel Ringrose. “Personally, I’ve always thought history needs a soundtrack, and this digitation project will let many people experience that past by listening to it.”

The Internet Archive is in the midst of digitizing the entire lot, and Minot State’s MSU History 78s Project will have as its reward the complete collection digitized.

“We are preserving the state’s property,” Ringrose said. “This is a chance to partner with Internet Archive make accessible America’s sound history. There’s jazz in there, patriotic music, dance music, waltzes, western Americana and even some early rock and roll 78s. The earliest discs go back before World War I, to 1912 and 2013. There are a lot of regional labels and regional artists that would be hard to find.”

It took Minot State’s History 78s Project three weeks of work from volunteers to get the collection ready to ship to Internet Archive.

Thanks to a group of 19 volunteers that spent three weeks of summer mornings sorting the individual discs into sleeves and boxing them up for shipping the discs have been delivered to the Internet Archive.

“Many of the recordings never made it off the 78s they were originally on,” said Brewster Kahle, founder of the Internet Archive. “So, having the collection at the Internet Archive gives a breadth and depth to music people have not heard for decades.

“We are hoping there are many records from the middle of the country from the early 20th century representing the different immigrant communities and emerging blues, jazz and country genres.”

The Internet Archive, which currently has about 150,000 recordings digitized and available, has the capacity to digitize about 4,000 discs per month, making it likely that the entire Minot State collection will be available to the public by sometime in 2020.

“It’s exciting to have some organization make it possible to have people study this and to make it more accessible,” Ringrose said. “Some of this stuff is still in original sleeves and may never have been played. Other stuff was so old and so fragile, and we didn’t have any equipment to play it on. This can be a learning tool.

“We recruited volunteers aged from 12 to 72 to box this stuff up for shipping. And the 12-year-old is asking how much music is on one of these discs. There is no point of reference for a disc that may only have three minutes of music on it. The whole collection might fit on that iPod that Steve Jobs held up 10 years ago, and we needed a whole semitruck to ship it.”

As the newly acquired discs from the MSU History 78s Project get digitized, they will become part of the The Great 78 Project at Internet Archive.

And there is no shortage of those waiting for the new additions. There are almost 5,000 followers of the Internet Archive’s twitter robot feed highlighting a newly digitized record every hour (https://twitter.com/great78project).

The Minot State donation may be at the leading edge of a new wave of library activity. The North Dakota school wanted to find a way to get the collection out to a wider audience and target the space that the recordings were taking up to other purposes. The first thought was to offer all that music to another school, UC Santa Barbara, with a large collection of 78s. UCSB couldn’t make it happen, but David Seubert from the school introduced Ringrose to Kahle and Internet Archive.

“We are thankful that with our current funding, we can pack, ship, process and preserve the whole collection, both physically and digitally,” Kahle said. Minot State will get back a digital copy of their entire collection which Ringrose said he believes will prove to be more accessed and useful than the 78 collection had been in recent decades.

Kahle said that the addition of the Minot State collection together with a similar one from the Boston Public Library are the first steps in opening the world of early recorded music to the digital age. They are among the first to decide to reformat their collections as opposed to other options such as disposing of them.

“By bringing antique recordings to the Internet, we hope to get closer to universal access to all knowledge,” Kahle said, adding that the Internet Archive’s goal is for 400,000 78-rpm recordings to be put online, of which about 150,000 have already been digitized by Internet Archive. “We are actively looking for records we do not already have.

“We would like these recordings to be linked to from Wikipedia and discographies, and also from reference sites such as discogs.com and musicbrainz.org. We even have an interface to the collection through Amazon’s Alexa. If you say `Alexa, ask Internet Archive to play tango 78s,’ it will play for hours. Similarly, with Google Home, if you say `Hey Google, ask Internet Archive to play hillbilly 78s.’

“Whatever it takes to get this music into the hands and ears of digital listeners.”

Posted on October 20, 2019 by Carl Malamud

In India, many speeches begin or end with the phrase “Jai Hind.” Jai means “long live” and “Hind” of course is the great Republic of India, the largest democracy in the world. Jawaharlal Nehru popularized this phrase, and it became the battle cry of the fight for liberation. “Gyan” means knowledge, and I have taken to ending my speeches in India with “Jai Hind! Jai Gyan!”

In this blog post, I’d like to tell you a little bit about my work in India, talk about some of the amazing resources having to do with India on the Internet Archive, and recognize some of my colleagues who have shown me so much kindness and given me so much inspiration.

2019 | Internet Archive Blogs (4)

India was not a complete stranger to me. It was one of the stops in my Internet travelogue Exploring the Internet, I finished one of my first books on a houseboat in Srinigar, and India played an important role in the Internet World’s Fair, and I was honored to have His Holiness the Dalai Lama write the foreword to my book about the fair and allowed me to present him a copy in Dharamshala.

For the last years however, I’ve been spending a great deal of my time in this amazing country. I wrote about my passage to India last year in the book Code Swaraj, which is of course available for free, no rights reserved, and has been translated into Hindi, Urdu, Bangla, Punjabi, Gujarati, Kannada, Martha, Tamil, and Telugu. My fascination with and commitment to India comes from many sources, but I was particularly inspired by the movement for liberation and the lessons we can learn from Mahatma Gandhi for how one can confront authority and change the world.

My work in India has been made possible by a generous grant from the Arcadia, to whom I am immensely grateful for allowing me to pursue this work. Arcadia has been instrumental in promoting open access throughout the world, including support for the Internet Archive and many other groups.

2019 | Internet Archive Blogs (5)

The Internet Archive now hosts one of the very largest collections of materials by and about India. Let me tell you about a few of these:

  • The Public Library of India is a collection of books mirrored from the Internet in over 100 languages. Many of those books we archived are no longer available in their original locations, so we are thrilled that the Internet Archive became a home for these valuable materials. Some of the scans are old, some of the metadata and quality control is not so good, but the materials are unique and our 425,121 texts have received over 62 million views.
  • Because the metadata on the Public Library of India had been entered in Roman characters, that made the collection much less useful for non-Roman scripts, a team of Wikipedians led by my friend Arjuna Rao Chavala painstakingly reentered the titles and creators for 17,655 books in Telugu into the original script, making the books findable for those that speak those languages. That effort is now being replicated for other languages, such as Kannada and Tamil.
  • One of my personal passions has been the Hind Swaraj collection, which is devoted to the fight for Indian independence. The collection features 595 works about Gandhi Ji including all 100 volumes of his Complete Works, as well as the complete works of Nehru, Ambedkar, and substantial collections of texts and audio by figures such as Rabindranath Tagore, Sarvepalli Radhakrishnan, Subhas Chandra Bose, and Sardar Patel.
  • Public Resource has been honored to work closely with the Indian Academy of Sciences, with whom we have a formal Memorandum of Cooperation. As part of that effort, we have digitized all the Indian Academy’s books, and maintain an extensive collection of science resources of India. It has a great pleasure to work with and become friends with my colleagues at the Academy, including the distinguished Professor Amitabh Joshi who has spearheaded this effort and Professor Partha Majumder, the President. We have in place similar Memoranda of Cooperation and have hosted collections for the JC Bose Trust and the National Center for Biological Sciences.

One of the most gratifying things about working in India is the public spirit, technical skills, and enthusiasm of volunteers all over the country. We have banded together and call ourselves the Servants of Knowledge, a hat-tip to Gokhale’s Servants of India society. The Indian Academy allowed kindly allowed us to place a Table Top Scribe in their Bengaluru headquarters, a unit which was donated by the Kahle Austin foundation. It has been a true delight to work with and learn about the Internet Archive digitization framework which is extensive and incredibly powerful.

2019 | Internet Archive Blogs (6)

The Servants of Knowledge collection and the scanning effort in Bengaluru is managed by my friend Omshivaprakash, a long-time wikipedian and a passionate advocate for the Kannada language and heritage. Likewise, Shiju Alex has been toiling for years to digitize key works in Malayalam. In Mangaluru, Prashanth Shenoy has led the effort for Konkani texts, and in Chennai the indomitable T. Shrinivasan has long worked to make more Tamil resources available on the Internet.

2019 | Internet Archive Blogs (7)
2019 | Internet Archive Blogs (8)

Since, 2011, Public Resource has worked with Dr. Sushant Sinha, the founder of Indian Kanoon, the amazing free site that provides access to all case law and other legal materials in India. Indian Kanoon was recently honored with the prestigious Agami Prize for service to the citizens of India. Sushant and I have been working on a project for a year that we believe is transformational, pulling in the Official Gazettes of India from the central government and 19 states, an archive that is updated daily and has over 455 documents. (The Official Gazettes are the newspapers of government, akin to the Federal Register in the United States.)

2019 | Internet Archive Blogs (9)

Particularly impressive has been Sushant’s effort to extend the Internet Archive by doing OCR in Indian languages. He has written code that pulls a document off the Internet Archive and bounces it off Google Vision for the OCR, then recreates the files that the Archive would expect to see if it had done the OCR in the Abby software it uses. The code is now working, and he’s been applying it to mixed-language Gazettes in Hindi and English and to the Karnataka Gazette in the Kannada lanaguge. The code is totally open source, we are beginning to apply it to books, and we are hoping to supplement Google Vision with tesseract and other modules.

Public Resource has two other major efforts in India, both of which we believe have the potential to be transformational not only in India but in the rest of the world.

  • In Delhi, we have a formal memorandum of research cooperation with Dr. Andrew Lynn of Jawaharlal Nehru University where we have created the JNU Data Depot, an effort to advance text and data mining on the scientific corpus by researchers. The system is carefully modeled after the Hathi Trust effort in the U.S. and makes carefully secured access to the corpus available to non-commercial university researchers who are able to perform non-consumptive text and data mining. This project was recently featured in Nature, the international journal of science. In addition to the JNU facility, Public Resource has installed a mirror at IIT Delhi, under the direction of Dr. Sanjiva Prasad. We have a distinguished board of advisors from universities throughout India and have received legal advice and counsel from some of the most distinguished intellectual property experts in the country, including Professor Arul George Scaria, Professor N. S. Gopalakrishnan, Professor Feroz Ali, and Professor Lawrence Liang. I have also been grateful for the personal insights and friendship provided to me by Dr. Zakir Thomas, a senior civil servant and the former Registrar of Copyrights for India.
  • One of our initial programs in India has been to make available all Indian Standards, the public safety codes of India. We have made 18,471 such standards available in our Public Safety Codes Collection and the documents have been invaluable for millions of Indian students, government officials, and others who need to consult these valuable government-issued rules and regulations. We have filed a public interest litigation writ petition before the Hon’ble High Court of Delhi after the government objected to our efforts. My co-petitioners are Dr. Sinha and my friend Srinivas Kodali. We are represented before the Hon’ble High Court by senior advocates Jawahar Raja and Salman Khurshid and the law firm of Nish*th Desai and Associates.

You can read more about my efforts in India on the Public Resource Docket where I keep a listing of speeches, press, and other public information.

I close on a sad note. India lost a remarkable person this year and I lost a dear friend. Shamand Basheer passed away at the young age of 43 after a very long illness. In his short time on earth he touched so many lives, mine included.

2019 | Internet Archive Blogs (10)

Shamnad had the finest legal mind of anybody I have ever met, and I have had the privilege of working with many of the best. Shamnad, though, was on a wholly different level. He was considered to be the leading intellectual property expert in India, but he also pursued justice in many other areas of the law. He filed a petition that challenged discrimination in law school admissions, he intervened in the landmark Novartis case, he played an instrumental role in the Delhi University Photocopy case.

But he did so much more. His greatest accomplishment is IDIA, which he created and spent his greatest efforts with. IDIA’s mission is increasing diversity by increasing access to legal education. They find young students with great potential but living in impoverished circ*mstances, get them ready to take the exam to get into law school, then stick with them to get them through the program. These young lawyers then go back to their communities to provide justice. It is an immensely inspirational program and you can do nothing better to honor Shamnad’s memory than donate to IDIA.

I read a lot about Gandhi, I speak about him frequently and learned much from his work. Gandhi is an important part of my life, but with Shamnad it was different. Shamand, more than anybody I ever met, lived his life like Gandhi. He was a public worker, devoted to justice and equality. He knew a vast number of people, and every person he met, he touched their lives deeply.

He had been dreadfully sick for so many years, but he did not let that stop him. I admonished him once when he invited me to speak at the IDIA annual event and he had just flown in the night before from Iran, where he was advising people on intellectual property. You should take it easy, I told him. He replied that he wasn’t going to let his illness rule his life, and he never stopped his public work for one minute.

The night before he passed away, we were exchanging messages on WhatsApp. He had been especially sick of late and had gone on a pilgrimage to Bababundangiri. His last words to me, which I wish to share were:

I am in a very special place right now. Even as my body is battered my spirit is strengthening. In Baba Budangiri a site of amazing syncretic spirituality. Where I have found much peace and meaning. And a place that has helped me transcend the body. And in this special place, I am offering prayers for you. And sending you lots of good energy. To continue the good fight. Lots of love. Shams

Wherever Shamnad is now, his star will shine bright for the ages. He will forever be missed and forever remembered. May the gods bless you Shamnad, thank you for all you did for me and for so many others.

Posted on October 18, 2019 by John Hickey

Extinction isn’t just a biological issue. In the 21st century, it’s a technical, even digital one, too.

The average web page might last three months before it’s altered or deleted forever. You never know when access to the information on these web pages is going to be needed. It might be three months from now; it might be three decades. That’s how the Wayback Machine serves—making history by saving history. Now, the Wayback Machine is fighting digital extinction in brand new ways.

As the Internet Archive prepares for its anniversary celebration on Oct. 23, our Wayback Team is unveiling some new features to make what some call “the memory of the web” even more detailed and responsive.

Try out some of our new Wayback Machine Features:

  • Changes: a new service enabling users to select two different versions of a given URL and compare them side by side. Differences in the text of the content are highlighted in yellow and blue.
2019 | Internet Archive Blogs (12)
2019 | Internet Archive Blogs (13)

Just click the “Changes” link at the top of the “Calendar View” page to find an index of archives of the selected URL with a high-level indicator of the degree of change between the available archives. When no content has changed, the page appears in the same color. You can then select any two archived versions of the page so they can be rendered side-by-side with the changes between them highlighted in blue and yellow. Best of all you can then share this “Changes” URL with others (e.g. via Twitter or embedded in a news story) so others can easily see the changes as well.

  • Save Page Now: an updated version of perhaps the most popular feature of the Wayback Machine. Of particular import is the new ability to archive all the embedded links and outlinks (connections to external web sites) with just one click.
2019 | Internet Archive Blogs (14)

Also new is the ability for users to save web archives in a public directory of favorite items. It’s essentially a personal but public bookmarking system of pages that others can follow. Imagine how important this might be for future researchers, family members or fans interested in the web pages you chose to personally save for all time.

  • Collections: A new way to learn about why a given URL has been archived into the Wayback Machine. Start by clicking the “Collections” link at the top of any “Calendar View” page. You will then be shown a list of all the collections that this URL is included in, plus you can select individual playback URLs from any of those Collections. Click on the Collection name to learn more about its provenance.And if it was created as part of the Internet Archive’s Archive-It service, you can execute full-text searches on archived web pages that are part of that collection.
2019 | Internet Archive Blogs (15)
  • Show All Captures: The Wayback Machine archives some URLs many times a day.In some cases hundreds, or even thousands of times a day. While all of those captures have been available for playback, the calendar view would only show a sampling of those captures.The new Show All Captures feature now presents a list of each and every capture available per day, even for captures that are made seconds apart.
2019 | Internet Archive Blogs (16)

Who will be using these new features? Earlier this month, Mark Graham, the Wayback Machine’s director, got a request from a TV journalist for help—not just for something Trumpian or Brexit-ish. Instead, the just-married journalist saw that her wedding day web page was about to expire and wanted to be sure it would be preserved. Using the new and improved Save Page Now, she was able to preserve the page (including all outlinks) with one click.

The Environmental Data and Government Initiative (EDGI) partners with the Wayback Machine in its work monitoring government websites with particular emphasis on environmental issues. Our new Changes feature will help them track and publicize how government agencies are deleting and altering information about climate change and environmental protection issues, by comparing and publishing web pages side-by-side.

Graham underscores that the Wayback Machine, which has many scholarly, historical and journalistic uses, “is relevant to how you live in the United States today. Wayback Machine captures are even admissible in many courts.”

“It can be used for holding people and governments accountable,” he said. “At the same time, it can be used for other things, like a bride’s request to preserve a wedding page.”

Fighting digital extinction, the Wayback Machine way.

NOTE: On October 23rd, come by the Wayback Machine Demo station at our World Night Market event to meet the team who built these new features. You can purchase your tickets here.

Posted on October 17, 2019 by Jenica Jessen

2019 | Internet Archive Blogs (18)

The Internet Archive’s central mission is establishing “Universal Access to All Knowledge,” and we want to make sure that our library of millions of books, journals, audio files, and video recordings is available to anyone. Since lack of an internet connection is a major obstacle to that goal, we created the Offline Archive project—that works to make online collections available regardless of internet availability.

For many of our readers, the internet seems omnipresent—like electricity and running water, it’s available everywhere from our homes and offices to trains and planes. But for more than half of the world’s population, that access is far from guaranteed. In many developing countries and rural areas, the infrastructure that enables internet access is unreliable, slow, or nonexistent, while natural disasters and conflicts may exacerbate the problem. Additionally, internet access can be too expensive for many people, and some governments limit internet access or censor the content for political reasons. All of these factors can combine to make internet access inconsistent, low-quality, or altogether unavailable for billions of people, which in turn leads to poor educational outcomes and intergenerational poverty. Compounding the challenge, the internet in wealthier countries is growing rapidly, and high-bandwidth videos and graphics are making it harder than ever for people on low-quality networks to participate in the modern web.

As part of a solution to this problem, we have built an offline server that transfers Internet Archive collections to a local server, caches content while browsing, and delivers the Internet Archive UI offline in the browser. The system moves content between servers by “sneakernet”—on disks, USB sticks, and SD cards. This approach should improve access for anything from a Raspberry Pi to an institutional server holding terabytes of data. Right now, we’re working to make it available in a variety of different languages, so that anybody can utilize it—not just English speakers.

2019 | Internet Archive Blogs (19)

Best of all, the Offline Archive project is open source, so that people around the world can collaborate to make it better. We are currently integrating the Archive’s APIs with those of our partners, to make it easier for them to incorporate Internet Archive content. Together with our collaborators, we can bring the Internet Archive anywhere—ensuring that people everywhere can enjoy our digital library.

If you would like to lend a hand, there are lots of ways to collaborate:

  • Software developers can help us add features, platforms, and internationalization
  • Platform developers can talk to us about integrating the Internet Archive’s content or server
  • Content owners and aggregators can help make more content available, especially educational content and material in other languages.
  • Community networks and internet access practitioners can help by becoming early adopters

See archive.org/about/offline-archive for more information, or contact mitra@archive.org to collaborate or contribute to this project.

If you would like to see the Offline Archive in action and meet its builder, Mitra Ardron, then come to the Internet Archive World Night Market on October 23rd and look for the Offline Archive demo table!

Posted on October 17, 2019 by John Hickey

The recently reconstructed music player has more, much more, to offer in making music accessible.

This is a time of transition, musically speaking, at the Internet Archive..

Our online digital library is best known for its immense archive of web pages and websites in the Wayback Machines. Less well known are the million-plus recordings the site has stored digitally and made available to the general public, mostly from 78s, albums and CDs.

Highlighting the growing importance of music on archive.org is the debut this month of our new music player. While you can listen to only a sample of most modern songs, the new player now embeds Spotify and YouTube versions of the full song, so listeners are now able to click right from archive.org to those services and listen to the full track. Examples:https://archive.org/details/cd_ultimate-santana_santana-alex-band-of-the-calling-baby-bash andhttps://archive.org/details/cd_the-big-picture_big-l-2pac-a.g.-big-daddy-kane-fat-joe-gur

2019 | Internet Archive Blogs (21)

We’ve digitized at high resolution the album liner notes, including full CD booklets and the paper labels on the discs themselves. And at the bottom of each page are lists of related music tracks – covers, other versions of the same song done by the same artist and compilations where that song has been used.

2019 | Internet Archive Blogs (22)

“It’s exploratory; it’s not exact,” said Internet Archive’s Brenton Cheng, who is at the head of the product team engineering the new music player. “The system uses each song’s acoustic ‘thumbprint’ to match it with songs in other services. The goal here is to start engaging with the music.”

“With our related music tracks listed down below, you are going to be exploring and discovering items, covers and versions that you didn’t know existed before. I think now we’re doing a better job of presenting the content that we have, and then helping people discover more.”

As streaming services gain popularity, the rich fountain of information found on album covers and CD liner notes is in danger of being lost. The Internet Archive seeks to fill that void by preserving the entire package that makes for a deeper musical experience. Now exploring those covers is right there in the music player itself.

“I think our presentation experience has until now not been as much of a focus as our gathering of materials from different sources,” Cheng said. “So now we are really trying to take time and check with our users, finding out who’s using the site and what they need. And we’re trying to present better experiences for exploring, consuming and searching for content.”

Posted on October 13, 2019 by Jason Scott

2019 | Internet Archive Blogs (23)

Another few thousand DOS Games are playable at the Internet Archive! Since our initial announcement in 2015, we’ve added occasional new games here and there to the collection, but this will be our biggest update yet, ranging from tiny recent independent productions to long-forgotten big-name releases from decades ago.

To browse the latest collection, hit this link and look around.

The usual caveats apply: Sometimes the emulations are slower than they should be, especially on older machines. Not all games are enjoyable to play. And of course, we are linking manuals where we can but not every game has a manual.

If you’ve been enjoying our “emulation in the browser” system over the years, then this is more of that. If you’re new to it or want to hear more about all this, keep reading.

A Recognition of Hard Work, and A Breathtaking View

2019 | Internet Archive Blogs (25)

The update of these MS-DOS games comes from a project called eXoDOS, which has expanded over the years in the realm of collecting DOS games for easy playability on modern systems to tracking down and capturing, as best as can be done, the full context of DOS games – from the earliest simple games in the first couple years of the IBM PC to recently created independent productions that still work in the MS-DOS environment.

What makes the collection more than just a pile of old, now-playable games, is how it has to take head-on the problems of software preservation and history. Having an old executable and a scanned copy of the manual represents only the first few steps. DOS has remained consistent in some ways over the last (nearly) 40 years, but a lot has changed under the hood and programs were sometimes only written to work on very specific hardware and a very specific setup. They were released, sold some amount of copies, and then disappeared off the shelves, if not everyone’s memories.

2019 | Internet Archive Blogs (26)

It is all these extra steps, under the hood, of acquisition and configuration, that represents the hardest work by the eXoDOS project, and I recognize that long-time and Herculean effort. As a result, the eXoDOS project has over 7,000 titles they’ve made work dependably and consistently.

Separately from the eXoDOS project, I’ve been putting a percentage of these games into the Emularity system on the Internet Archive for research, entertainment and quick online access to the programs. The issues that are introduced by this are mine and mine alone, and eXoDOS is not able to help with them. You can always mail me at jscott@archive.org with questions or technical concerns.

This should be all that needs to be said, but since the Archive is doing things a little strangely, there’s a lot to keep in mind before you really dive in (or to realize, when you come back with questions).

That Hilarious Problem With CD-ROMs

Putting these games into the Internet Archive has, over time, brought into sharp focus particular issues with browser-based emulation. For example, keyboard collision, where the input needs of the emulator are taken over by the browser itself, and the problems of a program needing a lot more horsepower to run in a browser emulator than a user’s system can handle.

2019 | Internet Archive Blogs (27)

Some of these have solutions that aren’t always great (Buy faster hardware!) and in some cases the problem is currently terminal (these programs have been taken offline for a future date). But the most obvious and pressing is that games based off CD-ROMs take a significant, huge amount of time to load.

CD-ROMs were a boon to the early-to-late 1990s, allowing games to have audio and video like never before. Depending on the tricks used, you got full-motion video (FMV), the playing of CD audio tracks for background music, and levels and variation of content for the games far beyond what floppy disks could ever hope.

But it was also a very large amount of data (up to 700 megabytes per CD) and it’s one thing to have the data sitting on a plastic disc in a local machine, and yet another to have a network connection pull the entire contents of the CD-ROM into memory and hold it there as a virtual file resources. This is going to be an enormous lean on the vast majority of Internet users out there – downloading multi-hundred-megabyte files into memory and then keeping them there, and then losing it all when the browser window closes. Network speeds will improve over time, but this is probably the biggest show-stopper of them all for many folks.

If you find yourself loading up one of these games and facing down a hundred-megabyte download, consider one of the smaller games instead, unless it’s a title you really, really want to try out. Maybe in a few years we’ll look back at cable-modem speeds and laugh at the crawling, but for now, they’re pretty significant.

Some Jewels in the Mix

Luckily, there are some smaller-sized games in this new update that will load relatively quickly and are really enjoyable to look at and to play. Here’s some of my recommendations:

2019 | Internet Archive Blogs (28)

First, a game special to me: the IBM DOS version of Adventure, calling itself “Microsoft Adventure”. It’s actually a small rebranding of the original start of the text adventure world, “Colossal Cave” or ADVENT, by Don Woods and Will Crowther. Remixed to be sold by IBM and Microsoft, this is how I first got into these, and it boots up instantly, providing hours of fun if you’ve never tried it before.

2019 | Internet Archive Blogs (29)

Mr. Blobby, a 1994 DOS Platform game, has all the hallmarks of the genre – bonkers physics, bright and lovely graphics, and joyful music. Be sure to redefine the keys before you try to play it, because besides running and jumping, you can spin and take things. The game does not get less weird as you go along.

2019 | Internet Archive Blogs (30)

Super Munchers: The Challenge Continues is a 1991 remix of the original educational game that sent your “muncher” gathering up words representing a given topic or idea. The speed of the game, along with the learning aspect, make this one of the more zesty “edutainment” titles available from the time.

2019 | Internet Archive Blogs (31)

Street Rod is a wonderfully compact 1989 racing game where it’s the 1960s and you’re going to buy your first hot-rod, tune it up, and race it for money to buy better and better rides. It’s a mouse-driven interface and loaded with all sorts of tricks to make the game fit into a “mere” 600 kilobytes compressed. Initially simple and then well worth the effort!

2019 | Internet Archive Blogs (32)

Digger from 1983 is a Dig-Dug-Clone-but-Not that came out right as IBM PCs were starting to take off, and it’s a lovely little game, steering around a mining machine while avoiding enemies and picking up diamonds. The most unintuitive thing is you need to fire using the “F1” key, so hopefully your keyboard has one.

2019 | Internet Archive Blogs (33)

I’m also going to suggest Floppy Frenzy from Windmill Software because it’s so much closer to the beginning of the IBM PC’s reign and you can see the difference in what the authors were comfortable with – the graphics are simpler, the game movement a little more rough, and the theme is geekiness incarnate: You’re a floppy disk avoiding magnets to leave traps for them, so you can gather the magnets up before the time runs out. If you don’t make it, an angel comes down and brings you to Floppy Disk Heaven. Again, F1 is the unusual key to leave traps.

There’s many more and I suggest people browse around and try things out, really soak in that MS-DOS joy. (And feel free to leave comments with suggestions.)

Thanks so much for coming along on this emulation journey!

  • Jason Scott, Internet Archive Software Curator
2019 | Internet Archive Blogs (34)

Posted on October 9, 2019 by Faye Lessler

2019 | Internet Archive Blogs (35)

What is lost when globalization dictates modern culture? In Bali, it’s centuries of literature. The Balinese language is still commonly spoken, but the ability to read and write literary works in the Balinese script has largely been lost. Since much of Bali’s culture and history is told in written manuscripts called lontars, the Internet Archive and the linguists at PanLex are teaming up with a group of local Balinese supporters to build new technologies and tools to keep their script and literary culture alive.

Culture is made up of a million little pieces of history, ritual, and everyday life, and that’s exactly what’s written down on Bali’s lontars. These palm leaf manuscripts date back hundreds of years; their subjects include advice on how to build a temple, how to make traditional medicine, and even how to choose the best co*ck to bet on in a co*ckfight, based on the date in the Balinese calendar.

Unfortunately, these ancient teachings—which were created by etching script into dried palm leaves and blackening the words with soot—were in danger of being lost forever due to humidity and time. And although they contain vital pieces of Bali’s rich cultural heritage, the lontars are unreadable for most Balinese who conduct their modern lives more and more in Indonesian.

2019 | Internet Archive Blogs (36)

So in 2011, the Internet Archive launched a project with the Culture Office of Bali to photograph and upload to archive.org some 3,000 lontar manuscripts made up of 130,000 palm leaves—making up “90% of Bali’s literature,” according to Bali’s Minister of Culture. The Internet Archive has preserved these texts in the Balinese Digital Library collection, but they realized that simply digitizing the lontars was not enough, as the resulting images were not easy to share or understand.


This year, the Internet Archive began working with PanLex, an organization dedicated to keeping the world’s languages alive, to engineer methods to transcribe some 3,000 palm leaves online. The team discovered that keyboards do not easily support Balinese script and that there was no complete font for the language, so PanLex worked with a font designer to create new fonts and created a new Keyman keyboard that enables users to type Unicode Balinese script on standard keyboards. These new tools empower more people to participate in transcription and makes it easier to use another tool that auto-generates a Romanized version of Balinese. Transforming the Balinese lontars from lifeless PDFs to machine-readable text means the rich cultural information contained within the lontars will be easier to read, format, and share.

2019 | Internet Archive Blogs (37)

Now thanks to more than 15 local Balinese contributors who are transcribing the lontars online, the digitized texts are published to PalmLeaf.org, where they are available for all. This community-curated Wiki encourages participation by anyone who wishes to contribute to transcribing and translating the lontars. By reading and transcribing the lontars, community partners have an opportunity to absorb the knowledge they contain and share it throughout the world. Through this work, young Balinese are finding more ownership of and connection to their cultural heritage.

“Part of what we’re hoping to change with PalmLeaf.org is to enable the community to take charge of the project and decide how it develops in the future,” says David Kamholz, Project Director at PanLex. “Our goal is to support their hopes of keeping traditional lontar techniques alive and exciting more people to read and be involved in Balinese literature (lontar).”

2019 | Internet Archive Blogs (38)

In a time when so many voices around the world go unheard online due to the language they speak, community involvement in this project is imperative. By transcribing the lontars in their original script, the project team places Balinese front and center, helping to normalize the use of Balinese online. Team members in Bali say that preserving these important texts and encouraging the use of their original language supports the social, cultural, and economic well-being for the people of Bali.

“This work is very helpful to us in Bali. Not everyone has the ability yet to read lontar. This opens access for more of us to learn about and study our literature.”

–Carma Citrawati, Balinese Transcriber
2019 | Internet Archive Blogs (39)

ABOUT THE AUTHOR: Faye Lessler is a California-born, Brooklyn-based freelance writer and founder of lifestyle blog, Sustaining Life. She is an expert in mission-driven communications and enjoys writing while sipping black tea in a beam of sunshine.

Posted on October 8, 2019 by Lila Bailey

2019 | Internet Archive Blogs (40)

In southern Maryland, St. Mary’s County is 54 miles long and there are only three libraries.

“We have people living at one end who might be 25 miles away from a branch,” said Michael Blackwell, Director of the St. Mary’s County Library that operates in the small communities of Leonardtown, Charlotte Hall and Lexington Park.

2019 | Internet Archive Blogs (41)

Yet, many of its rural and suburban residents do have cell phones and tablets. “People in this area are hungry for digital content. In surveys, they say there is not enough. Library digital use is growing, unlike library print use, which is very flat,” said Blackwell. “How to keep up with demand is a real challenge for us.”

That’s why Blackwell sees great promise in expanding the county’s digital offerings through Controlled Digital Lending, the digital equivalent of traditional library lending. CDL opens up access to rural patrons who may not otherwise be able to use the library because of transportation or other barriers. There are children whose parents work three jobs who need books for homework. There are shift workers who work during library hours. There are those for whom a physical trip to the library is simply not possible.

“I’m interested in CDL because a library the size of mine doesn’t have a lot of money,” Blackwell said. “By simply changing the format, we are getting the most out of the books we’ve already paid for. We are not trying to pick John Grisham’s pocket.”

Blackwell also notes that there are many works – including Pulitzer Prize winning books –
that publishers do not make available to libraries in digital form. This is an issue that is beyond rural access, it’s about no one having access at all to books that publishers choose not to provide digitally. For example, James Michener’s “Tales of the South Pacific,” is not available to libraries in e-book form through the major vendors. It is of interest as a story but also for revealing attitudes about human relations at the time of World War II. If a library wanted to circulate in e-book form, CDL would be the only option. It is, of course, the source for the musical, “South Pacific.”

Now, the St. Mary’s library has about 300,000 titles, including about 30,000 digital holdings. The library’s budget is about $3.8 million a year and the small funding increases usually go to salaries and health insurance, not leaving much for new acquisitions.

With high interest in digital content, CDL is being embraced in rural Maryland.

“We are working with the Internet Archive on a pilot to launch CDL titles through the Library Simplified, or ‘SimplyE,’ app,” added Blackwell. “A state grant is allowing us to deploy the app. Our patrons will be able to get all our e-book content, CDL and vendor-licensed, in one place. We’ll add quality content we can’t get in any other way at no cost other than storing our relevant print copies, ultimately expanding our offerings by thousands of titles. Our book hungry patrons will be much more likely to find a great title they want while they wait for the best sellers we can license.”

Posted on October 6, 2019 by Wendy Hanamura

Our job is providing ‘Universal Access to All Knowledge.’
Knowledge comes from many places.
Explore.Enjoy. Leave your mark.
Brewster Kahle,
Founder & Digital Librarian, Internet Archive

2019 | Internet Archive Blogs (42)

We invite you to join us for the Internet Archive’s biggest bash of the year:World Night Market, Wednesday, October 23, 5-10 PM. We’ll be closing the street and throwing a block party for our friends, neighbors and partners to celebrate our impact with partners around the world.

Get Your Tickets Now

When you arrive from5- 7 pm, we will give you your Passport to food trucks with our favoritefoods from Singapore to Mexico City to Delhi; beer & wines from around theworld; Lion Dancers and music, playful tattoos, plus hands-on demonstrations ofthe Internet Archive’s latest innovations and partnerships.

Stamp your Passport to Knowledge at these demo stations:

2019 | Internet Archive Blogs (43)
  • Put on a VR headset and play in a virtual Archive or try out vintage games
  • Find your earliest websites with the Wayback Machine’s latest tools
  • Scan a book on our Tabletop Scribe
  • Are you’re a vinyl lover? Come play with our latest tools to digitize & playback LPs
  • Search millions of hours of talk and news radio
  • Listen to tunes with our Music Player & get a guided tour of the new features
  • Search 1.6 million programs in our TV News Archive
  • Unlock the secrets of Balinese lontars—Palm Leaf manuscripts sometimes centuries old with the experts of PanLex
  • Meet our friends from Better World Books, one of the largest used booksellers in the world
  • Explore the history of the World Wide Web with Infomesh.org
  • Join the Open Libraries movement to provide online digital access to millions of books for billions of people
  • Learn how local librarians are preserving their community’s online history with Archive-It
  • Try out the “Internet Archive in a Box”—our decentralized, portable, offline version perfect for remote locations
  • Save a Book with OpenLibrary.org
  • Watch a new documentary about the creator of KidPix
  • And much more!
2019 | Internet Archive Blogs (44)
2019 | Internet Archive Blogs (45)

Then from 7-8 PM the Great Room program begins! In a world in which truth seems to be fracturing, what’s a library’s role? To weave the trusted knowledge held by libraries into the World Wide Web itself. We’ve invited our partners and builders to share their herculean efforts to make media more accessible and reliable than ever.

You won’t want to miss:

2019 | Internet Archive Blogs (46)
  • InformationActivist, Carl Malamud on freeing the information of India
  • OpenAccess visionary, Lisa Petrides on building an diverse, inclusive, and equitableUniversal K-12 School Library for all
  • InternetArchive’s Alexis Rossi & Jason Buckner on making talk & news radiosearchable, comparable and ultimately, accountable
  • BrewsterKahle on our project with WikimediaFoundation to take readers deeper and ensure the integrity of the world’sonline encyclopedia
  • Plus theInternet Archive Hero Award and amajor announcement about our future direction

And after the program, be sure to stay for the dancing, DJs and dessert on our side patio.

2019 | Internet Archive Blogs (47)

Having knowledge you can trust has never been more important. So let’s celebrate— get your passport now to the World Night Market!

Get your Tickets Here

Posted on October 3, 2019 by Lila Bailey

2019 | Internet Archive Blogs (48)

Technology is enabling libraries in Canada to promote diversity, safeguard historic documents, and expand access — all while helping to save the planet.

The Hamilton Public Library in the Canadian province of Ontario has nearly two dozen branches. Providing digital content to users in geographically remote areas is one of many reasons that the library has recently embraced Controlled Digital Lending, the digital equivalent of traditional library lending.

2019 | Internet Archive Blogs (49)

“It’s such an environmentally friendly, cost effective way of making titles available,” said Paul Takala, CEO/Chief Librarian of the library. “If we digitize it, somebody doesn’t have to go to the library to get it and we don’t have to ship books around. It just makes a lot of sense.”

The library also has rare and fragile Canadiana content that is not available anywhere else or able to be physically loaned out. This includes history of the local area, land documents, first-hand accounts of settlers, a large collection of photographs and a unique collection of works published in the 19th and early 20th centuries.

2019 | Internet Archive Blogs (50)

“Now we have the technology to share so many stories from so many voices through this platform to anybody 24/7, ” said Lisa Weaver, Director of Collections & Program Development at the library. “The preservation of books that CDL allows us to do and access that CDL allows us to provide is invaluable.”

When Hamilton joined Open Libraries, it was able to identify 53,000 books in its physical holdings that Internet Archive had already digitized. Those books were added to Open Libraries to increase lending counts for those titles. For example, the library digitized three titles that cover unique pieces of Canadian history: “The Trail of the Black Helmut” by G. Elmore Reamon (1957), “The Art of Northwest Coast Indians” by Robert Bruce Inverarity (1950) and “The Clockmaker” by Thomas Haliburton (1958), the first internationally best-selling author of fiction from what is now Canada. With the assistance of Internet Archive, Hamilton will later this year accelerate its scanning of older titles and some of its unique Canadiana collection to share beyond the library walls.

Researchers and genealogists have been particularly interested in discovering the digitized material. The new format allows users to access resources when they wish, during their commute, wherever they are, or even when the library is not physically open. The program also helps students who want to read classics that are not in copyright and now widely available.

“CDL helps us provide access to the broadest number of resources to the broadest number of Canadians,” Weaver said. “Having books in digital format also supports customers with print disabilities access the content.”

As more libraries partner with Internet Archive to make their collections available via CDL, more will be giving back and adding to the shared collection. “Part of the mission of public libraries is to educate residents about the history and richness of their communities,” said Takala. “It’s about making more items available to our customers. The benefits are clear.”

2019 | Internet Archive Blogs (2024)
Top Articles
Latest Posts
Article information

Author: Greg Kuvalis

Last Updated:

Views: 6038

Rating: 4.4 / 5 (55 voted)

Reviews: 94% of readers found this page helpful

Author information

Name: Greg Kuvalis

Birthday: 1996-12-20

Address: 53157 Trantow Inlet, Townemouth, FL 92564-0267

Phone: +68218650356656

Job: IT Representative

Hobby: Knitting, Amateur radio, Skiing, Running, Mountain biking, Slacklining, Electronics

Introduction: My name is Greg Kuvalis, I am a witty, spotless, beautiful, charming, delightful, thankful, beautiful person who loves writing and wants to share my knowledge and understanding with you.