Rare Book Monthly

Articles - August - 2019 Issue

A Deep-dive Database of Local History, Attitudes, and Ideas

Ulster County documents

Recently I purchased a small group of mid-Hudson Valley material that I found useful as examples of what would logically be included in a deep-dive experimental database for the New York State counties mid-way between New York City and Albany.  A what?  A deep-dive-database is a full text searchable database, something like what Google does in its Books section.  Whether it is a d3 or a FTSD or something else remains to be seen, but it is the future.

 

Databases of the printed word have generally been confined to brief descriptions and details of books and printed documents.  To see an actual copy, for example if you are using the OCLC, you are provided locations where such copies, physical and electronic, are found.   On RBH we focus on auction records and dealer descriptions to illuminate the emerging understanding of an example’s importance and value.  Such databases are potentially very large as ours is, more than 9 million full text records. 

 

But what is now emerging are full text databases.  That is, they capture the complete contents of a document in word searchable form, not only as a scan but as a word document.  Some efforts currently look for references in text but they have generally been dull instruments, in some cases because the references need to be dug out and in others because they are behind paywalls.    This will change and with this change there will be full text readable versions searchable online – and in many cases, searchable for free.

 

This experimental free database for the mid-Hudson Valley will include the standard reference materials, town and county histories, maps that convey changes, appropriate books by local authors, broadsides, pamphlets and ephemera – all in full searchable text.

 

The search will be different because the most common form posted will be ephemera that will outnumber books and pamphlets somewhere between a thousand and ten thousand to one.

 

Books usually include the title, author, publisher/printer, place and date printed.  When even one of these facts is missing it can complicate searches.  For ephemera you might be lucky to have three of these factors.  The others will require associated factors such as “they are among a group of letters in the same hand”.  Here’s an example.  A collection of letters from A.M. to B. R.  dated by day and month but not by year.  However, one envelope is dated 1863 and the events mentioned suggest the Battle at Chancellorsville.  Can this be figured out?  Probably.  As this example suggests, judgments will be made.

 

Here are some of the fields needed to identify and contextualize such letters.

 

Date or date range stated or implied

Names implied or known

Subject[s] such as events and places

Regimental references and information including cross-references

 

In addition, other fields will sometimes play a part:

 

Watermarks

Context of the document [among a group of similar items or with other related materials]

 

References gleaned from genealogical sites

 

References from online searches on Google and others

 

Altogether it will often, but not always, be possible to contextualize material, thus creating a deeper perspective – a perspective I believe that will change our understanding of the past.

 

Here are some other examples:  Ulster Mine at Ellenville, Ulster County, New York, a series of 5 printed documents, many with illustrations, that relate to this mine from 1852 to 1855 that include:

 

A 16 page report dated July 1st, 1852

 

An abbreviated broadside version dated July 1st, 1852

 

A 12 page report dated December 10th, 1852

 

A broadside, brief financial statement dated 15th December, 1852

 

A 16 page report dated January 3, 1854 titled Official Reports of the Ulster Company for the year 1853

 

This mine was located a short distance from the Delaware & Hudson Canal and was opened in 1852 during a period when Americans were looking everywhere for gold because of the stories emerging about the gold strikes in California.  In Ellenville they found lead while in Kingston some 20 miles away they believed they found gold that, when assayed, turned out to be pyrite or fool’s gold.  Such documents are so much more interesting than a title, date, author and print date.

 

Among the other documents I purchased is a stock receipt for the Hobart Branch Railroad Company signed by Thomas Cornell, who was a man of wealth whose steam boats coursed the Hudson River in the latter half of the 19th century.  He was based in Rondout but his influence reached in every direction.

 

Another is a menu for the Hotel Kaaterskill at Catskill for Thursday August 24, 1899.  Tastes have changed!

 

A small one is an 1857 7.625” x 5” broadside circular calling on teachers in Orange County to participate in a quarterly meeting to be instructed on new teaching approaches.  The teachers were expected to pay their own way but a handwritten note suggests the costs may be shared.

 

These are a few of the many documents that will contribute to an understanding of what life was like and altogether convey the changing assumptions and understanding people generally had.  Life has never been a paved highway and in the mid-Hudson Valley it seems more like a gravel path; every spec of gravel evidence of unique personal history.

 

An intensely focused, full text searchable database will bring these details to light.

 

Images of some of the examples are included with this article. 


Posted On: 2019-08-09 17:16
User Name: certainbooks

Hello Bruce: How would this proposed database differ from the current OCLC search fields, for instance? These search fields allow for choices in access method, accession number, author, author phrase, corporate or conference name, corporate and conference name phrase, personal name, personal name phrase, language type, material type, material type phrase and 18 more choices, per each search line - including a half-dozen under 'subject' alone. The search fields in OCLC offer these options, in three separate possible boxes, multiplying the search-ability by all those permutations. Additionally, there are year date, language and number of libraries searches as separate boxes. Limitation fields below go even further and allow for type of material: books, visual materials, computer files, internet resources, serial publications, sound recordings, archival materials, continually updated resources, articles, musical scores, maps allow for a narrowing of the field of search even further. There are additional limitations for availability possibilities too. Sincerely, George Krzyminski at Certain Books


Posted On: 2019-08-10 18:17
User Name: adminb

The OCLC, which I use but may not fully understand, shows how many copies are held among the more than 30,000 members of OCLC. So, for example I looked up “Art Work of Ulster County” recently and found 5 locations: LOC, NYPL, SUNY New Paltz, UCCC and Penn State. None of these copies are searchable online. Neither did I find it in Google Books.

To see the entire volume all pages including text and images will to be scanned and then converted into one or more word documents that random keywords searches can find. That’s the approach I’ll take to all material uploaded to this database.

In addition to books, all printed forms as well as manuscript material will be included.

This full text will be wide open to Google so that random terms and phrases found in this local database will create matches.

At a guess, and it’s strictly a guess, about 15% of the U. S. population has some connection to the mid-Hudson Valley.


Rare Book Monthly

  • ALDE, May 28: KIPLING (RUDYARD). Le Livre de la Jungle. – Le IIe livre de la Jungle. Paris, Sagittaire, Simon Kra, 1924-1925. €3,000 to €4,000.
    ALDE, May 28: NOAILLES (ANNA DE). Les Climats. Paris, Société du Livre contemporain, 1924. €50,000 to €60,000.
    ALDE, May 28: MILTON (JOHN). Paradis perdu. Quatrième chant. S.l., Les Bibliophiles de l'Automobile-Club de France, 1974. €2,000 to €3,000.
    ALDE, May 28: LEBEDEV (VLADIMIR). Russian Placards - Placard Russe 1917-1922. Saint-Petersbourg, Sterletz, 1923. €1,000 to €1,200.
    ALDE, May 28: MARDRUS (JOSEPH-CHARLES). Histoire charmante de l'adolescente sucre d'amour. Paris, F.-L. Schmied, 1927. €1,500 to €2,000.
    ALDE, May 28: TABLEAUX DE PARIS. Paris, Émile-Paul Frères, 1927. €2,000 to €3,000.
    ALDE, May 28: LA FONTAINE (JEAN DE). Les Fables illustrées par Paul Jouve. S.l. [Lausanne], Gonin & Cie, 1929. €4,000 to €5,000.
    ALDE, May 28: SARTRE (JEAN-PAUL). Vingt-deux dessins sur le thème du désir. Paris, Fernand Mourlot, 1961. €1,500 to €2,000.
    ALDE, May 28: [BRAQUE (GEORGES)]. 13 mai 1962. Alès, PAB, 1962. €3,000 to €4,000.
    ALDE, May 28: MIRÓ (JOAN). Je travaille comme un jardinier. Avant-propos d'Yvon Taillandier. Paris, Société intenationale d'art XXe siècle, 1963. €1,000 to €2,000.
    ALDE, May 28: MAGNAN (JEAN-MARIE). Taureaux. Paris, Michèle Trinckvel, 1965. €3,000 to €4,000.
    ALDE, May 28: PICASSO (PABLO). Dans l'atelier de Picasso. 1960. €15,000 to €20,000.
  • Sotheby’s
    Modern First Editions
    Available for Immediate Purchase
    Sotheby’s, Available Now: Winston Churchill. The Second World War. Set of First-Edition Volumes. 6,000 USD
    Sotheby’s, Available Now: A.A. Milne, Ernest H. Shepard. A Collection of The Pooh Books. Set of First-Editions. 18,600 USD
    Sotheby’s, Available Now: Salvador Dalí, Lewis Carroll. Alice's Adventures in Wonderland. Finely Bound and Signed Limited Edition. 15,000 USD
    Sotheby’s
    Modern First Editions
    Available for Immediate Purchase
    Sotheby’s, Available Now: Ian Fleming. Live and Let Die. First Edition. 9,500 USD
    Sotheby’s, Available Now: J.K. Rowling. Harry Potter Series. Finely Bound First Printing Set of Complete Series. 5,650 USD
    Sotheby’s, Available Now: Ernest Hemingway. A Farewell to Arms. First Edition, First Printing. 4,200 USD
  • Ketterer Rare Books
    Auction May 27th
    Ketterer Rare Books, May 27:
    K. Marx, Das Kapital,1867. Dedication copy. Est: € 120,000
    Ketterer Rare Books, May 27:
    Latin and French Book of Hours, around 1380. Est: € 25,000
    Ketterer Rare Books, May 27:
    Theodor de Bry, Indiae Orientalis, 1598-1625. Est: € 80,000
    Ketterer Rare Books
    Auction May 27th
    Ketterer Rare Books, May 27:
    Breviary, Latin manuscript, around 1450-75. Est: € 10,000
    Ketterer Rare Books, May 27:
    G. B. Piranesi, Vedute di Roma, 1748-69. Est: € 60,000
    Ketterer Rare Books, May 27:
    K. Schmidt-Rottluff, Arbeiter, 1921. Orig. watercolour on postcard. Est: € 18,000
    Ketterer Rare Books
    Auction May 27th
    Ketterer Rare Books, May 27:
    Breviarium Romanum, Latin manuscript, 1474. Est: € 20,000
    Ketterer Rare Books, May 27:
    C. J. Trew, Plantae selectae, 1750-73. Est: € 28,000
    Ketterer Rare Books, May 27:
    M. Beckmann, Apokalypse, 1943. Est: € 50,000
    Ketterer Rare Books
    Auction May 27th
    Ketterer Rare Books, May 27:
    Ulrich von Richenthal, Das Concilium, 1536. Est: € 9,000
    Ketterer Rare Books, May 27:
    I. Kant, Critik der reinen Vernunft, 1781. Est: €12,000
    Ketterer Rare Books, May 27:
    Arbeiter-Illustrierte Zeitung (AIZ) / Die Volks-Illustrierte (VI), 1932-38. Est: €8,000

Article Search

Archived Articles

Ask Questions