Message tei:1 - Read From: Text Encoding Initiative public discussion list Authorizing-Users: Lou Burnard To: Multiple recipients of list TEI-L [non receipt] Reply-To: Lou Burnard Subject: TEI for dummies? RFC-822-Headers: Content-transfer-encoding: 7BIT X-cc: lou@vax.ox.ac.uk Gloria McMillan's request for an introductory guide to the TEI scheme, complete with examples of how to use it in real life, is one we have been working on ways of satisfying for a while. Michael and I have both been giving introductory tutorials (both separately and jointly) at various venues using the same simplified version of the full TEI dtd for the last year or so... We're getting to the point where we're reasonably sure of the subset of the TEI dtd which such tutorials should define (though we're less certain what to call it -- current candidates are "TEI-Lite" and the "TEI Starter Set") . But the document we're currently working on so far has very little description of how to run sgml parsers, what sgml software does etc., and certainly no 'step-by-step' instructions of the kind Gloria seems to be asking for. Partly, that's because we try to provide such instruction as part of the live hands-on section of the Workshops where we've been using the document -- it's a lot easier to do that kind of instruction in person! Also, it's quite difficult to produce step by step instructions for a generic tutorial: one thing we have learned is that people learn a lot quicker if applications and examples are drawn from a field they are familiar with, and feel to be of importance and interest to them. This makes the general all-purpose document hard to write. It would be very useful to hear from readers of this list what kind of content they'd expect/desire to find in an introductory tutorial. Is there a general expectation that it should discuss matters like how to run an SGML parser, what kinds of applications exist, instructions on how to run some of them (and if so, which?) etc? If you'd like to check the current draft (bearing in mind that it is only a preliminary draft) TEI document TEIU5 is available from the usual places and we'd welcome comments on it from anyone who has time to read it, either directly or to the list. Lou Burnard (lou.burnard@oucs.ox.ac.uk) pp Michael SpMcQ (tei@uic.edu) who is not yet back from the far east Message tei:2 - Read From: Text Encoding Initiative public discussion list Authorizing-Users: Steve DeRose To: Multiple recipients of list TEI-L [non receipt] Reply-To: Steve DeRose Subject: Re: OSF's tpt package RFC-822-Headers: Content-transfer-encoding: 7BIT (Steve DeRose) For getting "out" of SGML, another tool you should look at is the SGML Hammer from SoftQuad (be *sure* to tell them I sent you when you call them). Last I remember it was a few hundred $, and SoftQuad at least used to have good academic discounts. Message tei:3 - Read From: Text Encoding Initiative public discussion list Authorizing-Users: Tobias Rischer To: Multiple recipients of list TEI-L [non receipt] Reply-To: Tobias Rischer Subject: OSF's tpt package RFC-822-Headers: Content-transfer-encoding: 7BIT I thought it was just me who had missed the information in the sgml newsgroup, but I got several requests, so here it is, for everybody: ftp grabbag.osf.org username: osfdtd password: osf8879 cd /osfdtd get TPT.tar.Z.uue To process the examples and documentation, you will also need the osf-dtd, it lies in the same directory. I had problems exporting to LaTeX and printed out the manualpages instead. The conversion process for the documentation is driven by a makefile, it worked only with gmake on my machine. I haven't done much more than looking at the doc's for `instant', the conversion program, and `transpecs', the specification format for the conversion. The transpecs files coming with the package also give an impression of what can be done and how. I have been pointed to perl and awk as possible alternatives; what I appreciate with the `instant' program is the SGML awareness that might make the processing of massive tagging more natural. Perhaps, `instant's role is that of an intermediate filter that reduces the massive information in sgmls output to what you really care about, and what you then want to process with awk, perl, etc.... Thanks for all replies -- Tobias -- ........................................... : [i Tromsoe til ca. 6'95] : : :-) : : Tobias Rischer : : Tunveien 9 A21 : : 9018 TROMSOE : : NORGE : : : : email: : : tobiasr@stud.cs.uit.no : : (rischer@informatik.tu-muenchen.de) : :.........................................: Message tei:9 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: christian wittern To: Multiple recipients of list TEI-L [non receipt] Reply-To: christian wittern Subject: Re: TEI for dummies? RFC-822-Headers: Content-transfer-encoding: 7BIT > Partly, that's because we try to provide such instruction as part of > the live hands-on section of the Workshops where we've been using the > document -- it's a lot easier to do that kind of instruction in person! > Also, it's quite difficult to produce step by step instructions for a > generic tutorial: one thing we have learned is that people learn a lot > quicker if applications and examples are drawn from a field they are > familiar with, and feel to be of importance and interest to them. This > makes the general all-purpose document hard to write. I think a step by step introduction should only be a short general prelude to more specialised introductions, nevertheless they certainly are necessary, IMHO for two reasons: 1. It seems that many people, who get attracted by TEI are new to SGML as well. It would certainly help to have a guided and safe way for the first few steps. 2. TEI is a rather complex SGML application, which requires some adaption of generic SGML tools. One example for this is the NAMELEN, on which my version of SGMSL still chokes. (I believe it can be adapted, but don't know how: This is exactly what I would like to read in a Gently Introduction to using TEI. Christian Wittern, Kyoto Lou Burnard (lou.burnard@oucs.ox.ac.uk) pp Michael SpMcQ (tei@uic.edu) who is not yet back from the far east Message tei:10 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: Henning Lobin To: Multiple recipients of list TEI-L [non receipt] Reply-To: Henning Lobin Subject: minimal changes to the SGML declaration RFC-822-Headers: Content-transfer-encoding: 7BIT Can anybody tell me, how a SGML declaration has to look like if I only want to make few simple changes to the concrete syntax. I want to use names with a length up to 20 characters and I want to have up to 55 tokens in an attribute group. I have lots of SGML material but I simply cannot figure out how the complete SGML declaration looks like with the two described changes. I want to use this alternative SGML declaration in connextion with the SoftQuad RulesBuilder. For any kind of help I would be very happy. Henning Lobin University of Bielefeld Germany Message tei:11 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: Gloria McMillan To: Multiple recipients of list TEI-L [non receipt] Reply-To: Gloria McMillan Subject: Are HTML and SGML compatible? RFC-822-Headers: Content-transfer-encoding: 7BIT The computer wiz at my college also thinks it would be most helpful to her to find out just how compatible HTML and SGML are. Can anyone out there tell me/her? Thanks, Gloria ----- Gloria McMillan Adjunct Faculty, Pima Community College EMail: GMcMillan@East.Pima.EDU Writing Department Message tei:12 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: Susan Parkinson-Wiberg To: Multiple recipients of list TEI-L [non receipt] Reply-To: Susan Parkinson-Wiberg Subject: Cultural Heritage Crucial to Development of NII RFC-822-Headers: Content-transfer-encoding: 7BIT THIS ANNOUNCEMENT IS BEING POSTED TO MULTIPLE ARTS AND HUMANITIES LISTS. PLEASE EXCUSE THE DUPLICATION PLEASE REPOST FREELY. The Getty Art History Information Program The American Council of Learned Societies The Coalition for Networked Information *ANNOUNCEMENT* Date: October, 1994 For Immediate Release: CULTURAL HERITAGE CRUCIAL TO DEVELOPMENT OF INFORMATION SUPERHIGHWAYS SAY NATIONAL LEADERS IN NEW REPORT Washington, DC -- An alliance of major humanities and arts organizations has issued a report calling on the federal government to recognize the crucial value of the American people's cultural heritage in planning the National Information Infrastructure (NII). "Humanities and Arts on the Information Highways - a Profile," coincides with the release of a related position paper by the White House Information Infrastructure Task Force at a Council on Competitiveness conference (September 7-8, 1994). The federal paper, "The Information Infrastructure: Reaching Society's Goals," presents strategies for applying information technologies to the cultural sector. The alliance, led by the Getty Art History Information program, the American Council of Learned Societies, and the Coalition for Networked Information, has issued its "Profile" to stimulate and guide response to the Administration's call for public comment. "The coming information highway has the potential to revolutionize the way we study and appreciate our cultural heritage," said Eleanor Fink, director of the Getty Art History Information Program. "Making the complex resources of the humanities and arts accessible on the networks would contribute profoundly to the most difficult technological challenges in computing." The "Profile" is the result of an ongoing process aimed at meeting the urgent need for the humanities and arts to gain a voice in the planning and development of the NII. Created by working groups of nationally recognized experts, the report defines the special challenges and opportunities associated with digitizing cultural heritage information. The "Profile" highlights ways in which a fully interactive and exploratory environment would transform the NII from a link between computers to a link between people. Speaking for the American Council of Learned Societies, president Stanley Katz observed that "Electronic technologies have the potential to transform information from a scarce, inequitably distributed and fragmented commodity into a true public good." Paul Evan Peters, executive director of the Coalition for Networked Information, expressed the hope that "our community-based process -- the "Profile" being the first outcome -- will generate the type of lively and open exchange of visions and needs that the Clinton administration is seeking." With the completion of this landmark document, the alliance plans to expand its base of organizational partners to encompass the full breadth of the cultural heritage sector. # # # (Note: electronic copies of the Profile are available upon request from the Coalition for Networked Information. Electronic copy can be accessed via FTP, Gopher, or the World Wide Web [see below]. Hard copies are available free of charge from Publications, Getty AHIP, 401 Wilshire Boulevard, Suite 1100, Santa Monica, CA 90401 or . Editors, please note the attached description of the Profile sponsors and names of individuals available for interviews or comment.) To access via FTP URL: ftp://ftp.cni.org/CNI/projects/humartiway/humartiway-rpt.Word.hqx URL: ftp://ftp.cni.org/CNI/projects/humartiway/humartiway-rpt.ps URL: ftp://ftp.cni.org/CNI/projects/humartiway/humartiway-rpt.txt ftp ftp.cni.org login: anonymous [send e-mail address as password] cd /CNI/projects/humartiway get humartiway-rpt.Word.hqx [Microsoft Word - BinHexed] get humartiway-rpt.ps [Postscript] get humartiway-rpt.txt [ASCII text] To access via Gopher URL: gopher://gopher.cni.org:70/00/cniftp/projects/humartiway/humartiway-rpt.Word.hq x URL: gopher://gopher.cni.org:70/00/cniftp/projects/humartiway/humartiway-rpt.ps URL: gopher://gopher.cni.org:70/00/cniftp/projects/humartiway/humartiway-rpt.txt gopher gopher.cni.org Coalition FTP Archives (ftp.cni.org) Coalition Projects (/CNI/projects) Humanities and Arts on the Information Highway (Final Report) To access via WWW URL: http://www.cni.org/docs/humartiway/www/humartiway-rpt.intro.html # # # Sponsors of "Humanities and Arts on the Information Highways - a Profile" The Getty Art History Information Program The Getty Art History Information Program (AHIP), one of six operating programs of the J. Paul Getty Trust, seeks to make art- historical information more accessible to scholars and researchers through the use of advanced computer technology. It does so by promoting common perspectives and standards among international institutions and organizations on projects in four general areas: working to affect policies that will enhance access to electronic information; coordinating vocabularies to facilitate consistent data entry and retrieval; providing bibliographic services; and assembling art historical databases. Among AHIPUs projects are the Art & Architecture Thesaurus, the Art Information Task Force, the Image and Information Standards Initiative, the Bibliography of the History of Art, the Avery Index to Architectural Periodicals, the Provenance Documentation Collaborative, the Witt Computer Index, the Union List of Artist Names, and the Thesaurus of Geographic Names. The Getty Art History Information Program, Director, Eleanor Fink, 401 Wilshire Boulevard, Suite 1100, Santa Monica, California 90401-1455, (310) 395-1025, ext. 1164, (310) 451-5570 fax, e-mail: efink@getty.edu The American Council of Learned Societies The American Council of Learned Societies (ACLS) is a private non-profit federation of 52 national scholarly organizations. The purpose of the Council, as set forth in its constitution, is "the advancement of humanities studies and the maintenance and strengthening of relations among the national societies devoted to such studies." Included in the program of the Council are awards to individual scholars to advance research in the humanities and humanistic aspects of the social sciences, support for international scholarly research and exchanges; activities concerned with the identification of present and future needs of humanistic scholarship, and planning and development to meet these needs; and organizational functions. In addition, the Council has fiscal and administrative oversight for the Council for International Exchange of Scholars (CIES), which administers the Fulbright program. The American Council of Learned Societies, President, Stanley Katz, 228 East 45th Street, New York, NY 10017-3398, (212) 697-1505, (212) 948-8058 fax, e-mail: snkatz@pucc.princeton.edu The Coalition for Networked Information The Coalition for Networked Information was founded in March 1990 to help realize the promise of advanced networks and high-performance computing for information access and delivery. The Coalition was established by three associations: The Association of Research Libraries (ARL), CAUSE and EDUCOM. ARL is an association promoting equitable access and effective use of recorded knowledge supporting teaching, research and scholarship. CAUSE and EDUCOM are dedicated to introducing, using and managing information technology and related sources in research in general and higher education. The Coalition of Networked Information promotes the creation of access to information resources in networked environments in order to enrich scholarship and enhance intellectual productivity. A Task Force of institutions and organizations able and willing to contribute resources and attention to the mission of the Coalition was created in 1990 and continues to grow. This Task Force now provides a common vehicle by which more than 200 institutions and organizations pursue a shared vision of information management and how it must change in the 1990s to meet the social, educational and economic opportunities and challenges of the 21st century. Members of the Task Force include higher education institutions, publishers, network service providers, computer hardware, software, and systems companies, library networks and organizations, and public and state libraries. The Coalition for Networked Information, Executive Director, Paul Evan Peters, 21 Dupont Circle, N.W., Washington, D.C. 20036, (202) 296-5098, (202) 872-0884 fax, e-mail: paul@cni.org Message tei:13 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: Gloria McMillan To: Multiple recipients of list TEI-L [non receipt] Reply-To: Gloria McMillan Subject: Info on automated tagging: TEI RFC-822-Headers: Content-transfer-encoding: 7BIT Hi! I was just talking to Cindy Dooling over at our computer dep't. and she suggested that I ask you this: Can anyone give info, which I might pass along to her about what is available and is possible to run over a VAX that will make TEI tags automatically? Cindy gave for an analogy the simple commands in Word Perfect that do bold w/o making the user say complex programming instructions, or the macros... What I am saying is that she is very interested to hear what might be out there that will semi-automate this whole, elaborate tagging process. It may be that there is no such animal, but it never hurts to ask... Thanks in advance!! Gloria ----- Gloria McMillan Adjunct Faculty, Pima Community College EMail: GMcMillan@East.Pima.EDU Writing Department Message tei:14 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: Stefan Heymer To: Multiple recipients of list TEI-L [non receipt] Reply-To: Stefan Heymer In-Reply-To: (message from Gloria McMillan on Thu, 20 Oct 1994 19:35:09 CDT)(060)9410210113.AA03344(a)informatik.uni-hildesheim.de(0 62) Subject: Re: Are HTML and SGML compatible? RFC-822-Headers: Content-transfer-encoding: 7BIT X-cc: TEI-L%UICVM.BITNET@vm.gmd.de Dear Gloria, HTML is just an application of SGML, i.e. a so called "Document Type Definition", short DTD. So HTML document instants can be parsed by almost any SGML validator (eg. SGMLS or YASP, the last of which I haven't had the opportunity to try). Most HTML viewers (as NCSA Mosaic and Chimera) aren't real SGML systems, so they'll happily ignore all tags of SGML that they don't know. Hope I've been helpful Stefan Heymer ------------------------------------------------------------------------------- Stefan Heymer heymer@informatik.uni-hildesheim.de University of Hildesheim Germany Message tei:15 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: Richard Vacca To: Multiple recipients of list TEI-L [non receipt] Reply-To: Richard Vacca Subject: SGML continuing education programs RFC-822-Headers: Content-transfer-encoding: 7BIT This announcement is being posted to both tei-l and sgml-l. Sorry for any duplication. Please distribute freely. The University of Wisconsin Dept of Engineering Professional Development offers these SGML-oriented continuing education courses, to be held in Madison, Wisconsin, during 1995. Building a Document Database with SGML January 23-25, 1995 June 19-21, 1995 This program assumes some knowledge of SGML or other markup scheme, and some knowledge of document management and textbases. It is aimed at DB and MIS managers, documentation managers, systems analysts. The instructors are Dale Waldt of Thomson Professional Publishing and Brian Travis of Information Architects, Inc. The fee for this three day course is $845. Creating Structured Documents February 27-28, 1995 June 15-16, 1995 This program is aimed primarily at publications staffs in organizations that produce manuals, catalogs, books, and online documents; also at managers who need to evaluate SGML and the structured document environment. The instructor is technical publishhing consultant Liora Alschuler. The fee for this two-day course is $595. For more information, complete course descriptions, or registration information, contact Dick Vacca (dvacca@macc.wisc.edu) or Tom Smith (smithtw@engr.wisc.edu). --Dick Vacca 608-246-0219 Internet: dvacca@macc.wisc.edu America Online: richv001 Message tei:16 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: Syd Bauman To: Multiple recipients of list TEI-L [non receipt] Reply-To: Syd Bauman Subject: Re: minimal changes to the SGML declaration RFC-822-Headers: Content-transfer-encoding: 7BIT Response to "minimal changes to the SGML declaration" posted on 20 Oct 1994 at 13:55 CDT by Henning Lobin > Can anybody tell me, how a SGML declaration has to look like if I > only want to make few simple changes to the concrete syntax. ..... > with the SoftQuad RulesBuilder. One method is to get Author/Editor to print out a copy of its default SGM declaration (using the Export... feature), and then make changes to that declaration. Of course, if you're doing TEI or TEI-lilke stuff, you may want to start with "sgmldecl.tei", which is supplied along with the DTDs. > I want to use names with a length up to 20 characters and I want to > have up to 55 tokens in an attribute group. (I'm betting you mean the number of names and name tokens in an attribute definition list.) You would want to add the following lines immediately after the line "QUANTITY SGMLREF": ATTCNT 55 NAMELEN 20 In the case of starting with the TEI's sgmldecl.tei, you would then want to delete their "NAMELEN 32". > I have lots of SGML material but I simply cannot figure out how the > complete SGML declaration looks like with the two described > changes. I think SGML declarations can be enormously difficult to learn (I don't pretend to even understand large chunks of them). One good place to start is the paper "Understanding the SGML Declaration" by Exoterica Corporation. Last I knew it was a copyrighted paper not available in electronic form, but Exoterica would send a hard copy just for the asking. -- Syd Bauman, Programmer/Analyst Brown University Women Writers Project Syd_Bauman@Brown.edu Message tei:17 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: David Megginson To: Multiple recipients of list TEI-L [non receipt] Reply-To: David Megginson In-Reply-To: (message from Gloria McMillan on Fri, 21 Oct 1994 15:23:41 CDT)(060)9410220507.AA16538(a)aix1.uottawa.CA(062) Subject: Re: Info on automated tagging: TEI RFC-822-Headers: Content-transfer-encoding: 7BIT X-cc: TEI-L%UICVM.bitnet@acadvm1.uottawa.ca >>>>> "Gloria" == Gloria McMillan writes: > Hi! I was just talking to Cindy Dooling over at our computer > dep't. and she suggested that I ask you this: > Can anyone give info, which I might pass along to her about > what is available and is possible to run over a VAX that will make > TEI tags automatically? Cindy gave for an analogy the simple > commands in Word Perfect that do bold w/o making the user say > complex programming instructions, or the macros... Are you tagging an existing text, or creating a new one? For creating a new text, the best free product available is PSGML, a fairly complete SGML editor running under Gnu Emacs. I have used PSGML extensively with the TEI DTD and have come to love its ability to insert the right tag in the right place. On the WWW, you can find information on the PSGML Home Page: http://www.lysator.liu.se/projects/about_psgml or you can download the program and documentation directly by anonymous FTP: ftp://ftp.lysator.liu.se/pub/sgml/psgml-1a3.tar.gz Note that for either of these, you require _at least_ version 19.19 of Gnu Emacs (the version numbers for XEmacs, nee Lucid, are different). You did not specify whether the VAX is running Unix or VMS. If it is running VMS, you should fire your computing staff :-); if it is running Unix, there are also commercial editors available which are faster and offer more bells and whistles -- you can find a fairly complete list by anon. FTP at ftp://ifi.uio.no/pub/SGML/SGML-Tools/SGML-Tools.txt (The same directory also has Postscript versions for several page formats). One of the most popular products, and also one of the earliest in the market, is Author/Editor from SoftQuad, which also has a range of other SGML-related products (I do not know which unices it runs on, however): SoftQuad Inc. 56 Aberfoyle Crescent, Suite 810 Toronto, Ontario M8X 2W4 Canada Tel: +1 (416) 239-4801 Fax: +1 (416) 239-7105 If you are adding markup to an existing text, you need to take a completely different approach. A good hacker can use a text-processing language like perl to do most of the work, but for a dedicated SGML system, I have been hearing good things about OmniMark from Exoterica, here in Ottawa (again, it might not run under your Unix): Exoterica Corp. 1545 Carling Avenue, Suite 404 Ottawa, Ontario K1Z 8P9 Canada Tel: +1 (613) 722-1700 Fax: +1 (613) 722-5706 Email: info@exoterica.com Do not by any means limit your search to these products, but give the list at ifi.uio.no a good and careful reading. David --- David Megginson Department of English, University of Ottawa, dmeggins@aix1.uottawa.ca Ottawa, Ontario, CANADA K1N 6N5 dmeggins@acadvm1.uottawa.ca Phone: (613) 564-6850 (Office) ak117@freenet.carleton.ca (613) 564-9175 (FAX) Message tei:18 - New, Unread From: Text Encoding Initiative public discussion list Authorizing-Users: Ole Norling-Christensen To: Multiple recipients of list TEI-L [non receipt] Reply-To: In-Reply-To: (059) from (q)Gloria McMillan(q)(060)no.id(062) Subject: Re: Info on automated tagging: TEI RFC-822-Headers: Content-transfer-encoding: 7BIT Gloria McMillan asks > what is available and is possible to run over a VAX that will > make TEI tags automatically? > DIPA is a fast and reliable parser that we developped in order to tag dictonary entries. Several printed dictionaries have been converted into SGML-files by this DIctionary PArser (e.g. for making new editions in the editing SGML-tool GestorLEX, or for extracting distinct information types for reuse). Of course, the input need not to be a dictionary, but it must be "dictionary like" in that it consists of not too big subdocuments (like dictionary entries) that can be parsed one at a time. As its input it takes: - a "grammar": a set of rewriting rules that define the structure which is to be made explicit by SGML-tags; three types of "leaves" are accepted: character sets (e.g. letters of an alphabet); concrete strings (e.g. different kinds of interpunction); and references to the "lexicon" - "lexicon" of classes of concrete strings, e.g. the dictionary's abbreviations for p.o.s., subject and register labels, etc. - the text to be syntax-checked and/or tagged. Output: - a file of all accepted entries - a file of the non-accepted entries, with a marking of the place in the text where the parser gave up its analysis (and the reason why) - an sgml-tagged version of the accepted entries You can run it in batch mode or interactively. In the latter case, any parsing error will cause a menu that gives the possibility of editing the grammar, the lexicon or the entry. For practical use, most input files have to be preprocessed in order to adjust tags/codes, and perhaps character set. In some cases, also some postprocessing of the tagged file may be wanted. For the time being, DIPA is used under DOS and UNIX; it is written in Microsoft C, so it should be possible to recompile it for use on e.g. a VAX. For further information, please contact me at this address: ***************************************** * Ole Norling-Christensen * * olenc@coco.ihi.ku.dk * * * * The Danish Dictionary, KUA * * Njalsgade 80, DK-2300 Copenhagen S * * Tel +45 3532 8995 - Fax +45 3154 2595 * *****************************************