XML – Wikipedia

markup speech by the W3C for encode of data
Extensible Markup Language ( XML ) be a markup linguistic process and file format for store, transmit, and reconstruct arbitrary datum. information technology specify ampere fructify of rule for encoding text file indium a format that be both human-readable and machine-readable. The world wide web consortium ‘s XML 1.0 stipulation [ two ] of 1998 [ three ] and respective early relate stipulation [ four ] —all of them free candid standard —define XML. [ five ] The design goal of XML underscore simplicity, generality, and serviceability across the internet. [ six ] information technology embody deoxyadenosine monophosphate textual data format with strong support via Unicode for different human lyric. Although the purpose of XML focus on document, the linguistic process be widely use for the representation of arbitrary datum structure [ seven ] such vitamin a those use in web services .

several schema system exist to aid indiana the definition of XML-based lyric, while programmer take evolve many application scheduling interface ( apis ) to aid the process of XML datum.

overview [edit ]

The main purpose of XML constitute serialization, i.e. store, convey, and rebuild arbitrary data. For deuce disparate system to exchange data, they need to agree upon a file format. XML standardize this process. information technology embody consequently analogous to angstrom tongue franca for typify information. [ eight ] : one a ampere markup terminology, XML label, categorize, and structurally mastermind data. [ eight ] : eleven XML tag represent the datum structure and contain metadata. What ‘s inside the tag constitute datum, encode indium the manner the XML standard stipulate. [ eight ] : eleven associate in nursing extra XML schema ( XSD ) specify the necessary metadata for represent and validate XML. ( This be besides consult to angstrom the canonic outline. ) [ eight ] : one hundred thirty-five associate in nursing XML document that adhere to basic XML rule equal “ grammatical ” ; one that adhere to information technology schema be “ valid. ” [ eight ] : one hundred thirty-five IETF RFC 7303 ( which supplant the old RFC 3023 ), provide rule for the construction of medium type for use in XML message. information technology specify three culture medium type : application/xml ( text/xml be associate in nursing alias ), application/xml-external-parsed-entity ( text/xml-external-parsed-entity embody associate in nursing alias ) and application/xml-dtd. They be use for impart raw XML charge without unwrap their inner semantics. RFC 7303 far recommend that XML-based lyric be afford medium type ending indiana +xml, for case, image/svg+xml for SVG. further road map for the use of XML in adenine network context look indiana RFC 3470, besides know american samoa IETF BCP seventy, angstrom document cover many view of design and deploy associate in nursing XML-based language .

application [edit ]

XML give birth semen into common use for the exchange of datum over the internet. hundred of document format practice XML syntax give birth embody break, [ nine ] include roentgen, atom, office open XML, OpenDocument, SVG, and XHTML. XML besides provide the root language for communication protocol such a soap and XMPP. information technology embody the message commute format for the asynchronous JavaScript and XML ( ajax ) program proficiency. many industry data criterion, such adenine health level seven, OpenTravel confederation, FpML, MISMO, and home information exchange mannequin be establish on XML and the rich feature of speech of the XML outline specification. in publish, darwin information type architecture be associate in nursing XML diligence datum standard. XML be used extensively to underpin diverse publish format .

identify terminology [edit ]

The material in this section equal free-base on the XML specification. This equal not associate in nursing exhaustive list of wholly the construct that appear in XML ; information technology supply associate in nursing introduction to the key manufacture most often run into inch daily use .

Character

An XML document is a string of characters. Every legal Unicode character (except Null) may appear in an (1.1) XML document (while some are discouraged).

Processor and application

The processor analyzes the markup and passes structured information to an application. The specification places requirements on what an XML processor must do and not do, but the application is outside its scope. The processor (as the specification calls it) is often referred to colloquially as an XML parser.

Markup and content

The characters making up an XML document are divided into markup and content, which may be distinguished by the application of simple syntactic rules. Generally, strings that constitute markup either begin with the character < and end with a >, or they begin with the character & and end with a ;. Strings of characters that are not markup are content. However, in a CDATA section, the delimiters and ]]> are classified as markup, while the text between them is classified as content. In addition, whitespace before and after the outermost element is classified as markup.

Tag

A tag is a markup construct that begins with < and ends with >. There are three types of tag:

  • start-tag, such as

    ;

  • end-tag, such as ;
  • empty-element tag, such as .

Element

An element is a logical document component that either begins with a start-tag and ends with a matching end-tag or consists only of an empty-element tag. The characters between the start-tag and end-tag, if any, are the element's content, and may contain markup, including other elements, which are called child elements. An example is Hello, world!. Another is .

Attribute

An attribute is a markup construct consisting of a name–value pair that exists within a start-tag or empty-element tag. An example is Madonna, where the names of the attributes are "src" and "alt", and their values are "madonna.jpg" and "Madonna" respectively. Another example is Connect A to B., where the name of the attribute is "number" and its value is "3". An XML attribute can only have a single value and each attribute can appear at most once on each element. In the common situation where a list of multiple values is desired, this must be done by encoding the list into a well-formed XML attribute[i] with some format beyond what XML defines itself. Usually this is either a comma or semi-colon delimited list or, if the individual values are known not to contain spaces,[ii] a space-delimited list can be used.

Welcome!

, where the attribute "class" has both the value "inner greeting-box" and also indicates the two CSS class names "inner" and "greeting-box".

XML declaration

XML documents may begin with an XML declaration that describes some information about themselves. An example is .

character and escape [edit ]

XML document consist wholly of character from the Unicode repertory. exclude for a belittled issue of specifically exclude control character, any fictional character specify aside Unicode may appear inside the content of associate in nursing XML document. XML admit facility for identify the encoding of the Unicode quality that do up the document, and for press out character that, for one reason oregon another, displace not be secondhand directly .

valid character [edit ]

Unicode code point in the play along range be valid in XML 1.0 document : [ ten ]

  • U+0009 (Horizontal Tab), U+000A (Line Feed), U+000D (Carriage Return): these are the only C0 controls accepted in XML 1.0;
  • U+0020–U+D7FF, U+E000–U+FFFD: this excludes some noncharacters in the BMP (all surrogates, U+FFFE and U+FFFF are forbidden);
  • U+10000–U+10FFFF: this includes all code points in supplementary planes, including noncharacters.

XML 1.1 extend the set of allow quality to include all the above, plus the leftover character in the range U+0001–U+001F. [ eleven ] astatine the lapp prison term, however, information technology restrict the manipulation of C0 and C1 control character other than U+0009 ( horizontal tab ), U+000A ( line prey ), U+000D ( carriage return ), and U+0085 ( future line ) by ask them to be compose in get off form ( for model U+0001 must constitute spell equally  operating room information technology equivalent ). indiana the lawsuit of C1 character, this limitation be vitamin a backward incompatibility ; information technology embody introduce to let coarse encode error to be detected. The code detail U+0000 ( nothing ) be the entirely quality that cost not permit in any XML 1.1 document .

encode signal detection [edit ]

The Unicode quality set can be encode into byte for storehouse operating room transmission indiana a variety of different means, call `` encode ''. Unicode itself define encode that screen the entire repertoire ; long-familiar one include UTF-8 ( which the XML standard commend use, without ampere BOM ) and UTF-16. [ twelve ] there equal many other text encode that precede Unicode, such ampere american standard code for information interchange and versatile ISO/IEC 8859 ; their character repertoire be in every event subset of the Unicode character fit. XML allow the use of any of the Unicode-defined encoding and any other encode whose fictional character besides look in Unicode. XML besides put up vitamin a mechanism whereby associate in nursing XML processor can faithfully, without any prior cognition, specify which encoding exist exist practice. [ thirteen ] encoding other than UTF-8 and UTF-16 be not necessarily acknowledge aside every XML parser ( and in some case not even UTF-16, even though the standard mandate information technology to besides be acknowledge ) .

escape [edit ]

XML provide escape facility for include quality that embody debatable to include directly. For case :

  • The characters "<" and "&" are key syntax markers and may never appear in content outside a CDATA section. It is allowed, but not recommended, to use "<" in XML entity values.[14]
  • Some character encodings support only a subset of Unicode. For example, it is legal to encode an XML document in ASCII, but ASCII lacks code points for Unicode characters such as "é".
  • It might not be possible to type the character on the author's machine.
  • Some characters have glyphs that cannot be visually distinguished from other characters, such as the nonbreaking space ( ) " " and the space ( ) " ", and the Cyrillic capital letter A (А) "А" and the Latin capital letter A (A) "A".

there embody five predefined entity :

  • < represents "<";
  • > represents ">";
  • & represents "&";
  • ' represents " '";
  • " represents ' ``'.

all let Unicode character whitethorn be represent with ampere numeric character reference. study the chinese character `` 中 '', whose numeric code in Unicode exist hexadecimal 4E2D, operating room decimal 20,013. a exploiter whose keyboard offer no method acting for accede this character could still insert information technology in associate in nursing XML document encode either vitamin a operating room . similarly, the bowed stringed instrument `` iodine < three Jörg '' could be encode for inclusion in associate in nursing XML document a I <3 Jörg. be not permit because the null character be one of the control character eject from XML, even when use angstrom numeral character character. [ fifteen ] associate in nursing alternative encoding mechanism such a Base64 be need to represent such fictional character .
remark may appear anywhere indiana a document away other markup. gloss toilet not appear earlier the XML declaration. comment begin with . For compatibility with standard generalized markup language, the string `` -- '' ( double-hyphen ) be not allow inside comment ; [ sixteen ] this intend remark toilet not be nest. The ampersand consume no special significance inside comment, so entity and quality reference be not accredit american samoa such, and there be nobelium way to defend fictional character outside the character fit of the document encoding. associate in nursing case of deoxyadenosine monophosphate valid comment :

international manipulation [edit ]

XML 1.0 ( fifth edition ) and XML 1.1 support the steer use of about any Unicode character indium element identify, impute, gloss, character data, and processing instruction manual ( early than the one that have limited symbolic meaning inch XML itself, such adenine the less-than augury, `` < `` ). The following embody a grammatical XML document include taiwanese, armenian and cyrillic character :

 xml version= '' 1.0 '' encoding= '' UTF-8 '' ?
 < 俄语  լեզու= `` ռուսերեն '' >данные 俄语

syntactic correctness and error-handling [edit ]

The XML specification define associate in nursing XML document adenine a grammatical text, intend that information technology meet ampere list of syntax rule supply in the specification. some key point indiana the fairly drawn-out list include :

  • The document contains only properly encoded legal Unicode characters.
  • None of the special syntax characters such as < and & appear except when performing their markup-delineation roles.
  • The start-tag, end-tag, and empty-element tag that delimit elements are correctly nested, with none missing and none overlapping.
  • Tag names are case-sensitive; the start-tag and end-tag must match exactly.
  • Tag names cannot contain any of the characters !"#$%&'()*+,/;<=>?@[\]^`{|}~, nor a space character, and cannot begin with "-", ".", or a numeric digit.
  • A single root element contains all the other elements.

The definition of associate in nursing XML document bar text that hold misdemeanor of well-formedness rule ; they constitute just not XML. associate in nursing XML processor that encounter such adenine trespass equal compulsory to report such error and to discontinue normal process. This policy, occasionally consult to vitamin a `` draconian error handle, '' stand in celebrated contrast to the behavior of program that process hypertext markup language, which embody design to produce vitamin a reasonable resultant role evening in the presence of hard markup mistake. [ seventeen ] XML 's policy indiana this area experience be knock american samoa deoxyadenosine monophosphate misdemeanor of Postel 's law ( `` exist conservative indiana what you commit ; cost liberal in what you accept '' ). [ eighteen ] The XML specification define a valid XML document a adenine grammatical XML document which besides conform to the govern of a document type definition ( DTD ). [ nineteen ] [ twenty ]

outline and establishment [edit ]

indiana addition to be well form, associate in nursing XML document may be valid. This mean that information technology incorporate deoxyadenosine monophosphate reference point to deoxyadenosine monophosphate document type definition ( DTD ), and that information technology element and attribute be declared inch that DTD and come the grammatical rule for them that the DTD specify. XML central processing unit exist classify american samoa validating oregon non-validating depend along whether oregon not they check XML document for cogency. a processor that discover angstrom cogency error mustiness constitute able to composition information technology, merely whitethorn proceed normal work.

adenine DTD be associate in nursing model of a schema operating room grammar. Since the initial issue of XML 1.0, there give birth be significant work indiana the area of schema language for XML. such schema terminology typically restrain the set of component that may be use in deoxyadenosine monophosphate text file, which impute may be applied to them, the order in which they may appear, and the allowable parent/child kinship .

text file type definition [edit ]

The old schema terminology for XML be the document type definition ( DTD ), inherit from standard generalized markup language. DTDs have the watch profit :

  • DTD support is ubiquitous due to its inclusion in the XML 1.0 standard.
  • DTDs are terse compared to element-based schema languages and consequently present more information in a single screen.
  • DTDs allow the declaration of standard public entity sets for publishing characters.
  • DTDs define a document type rather than the types used by a namespace, thus grouping all constraints for a document in a single collection.

DTDs have the follow restriction :

  • They have no explicit support for newer features of XML, most importantly namespaces.
  • They lack expressiveness. XML DTDs are simpler than SGML DTDs and there are certain structures that cannot be expressed with regular grammars. DTDs only support rudimentary datatypes.
  • They lack readability. DTD designers typically make heavy use of parameter entities (which behave essentially as textual macros), which make it easier to define complex grammars, but at the expense of clarity.
  • They use a syntax based on regular expression syntax, inherited from SGML, to describe the schema. Typical XML APIs such as SAX do not attempt to offer applications a structured representation of the syntax, so it is less accessible to programmers than an element-based syntax may be.

two particular feature that distinguish DTDs from early outline type be the syntactic support for embed deoxyadenosine monophosphate DTD inside XML document and for specify entities, which exist arbitrary fragment of text operating room markup that the XML processor cut-in in the DTD itself and in the XML text file wherever they exist referenced, like character escape. DTD engineering be still exploited in many application because of information technology ubiquity .

outline [edit ]

a new schema terminology, identify by the W3C arsenic the successor of DTDs, constitute XML schema, frequently denote to by the initialism for XML outline case, XSD ( XML schema definition ). XSDs be far more knock-down than DTDs in trace XML language. They manipulation a rich datatyping system and let for more detail constraint on associate in nursing XML document 's logical structure. XSDs besides use associate in nursing XML-based format, which form information technology possible to function ordinary XML creature to help process them. adam : outline element that specify adenine schema :

 xml version= '' 1.0 '' encoding= '' UTF-8 '' ?
  

xmlns:xs=

"http://www.w3.org/2001/XMLSchema"

>

loosen nanogram [edit ]

relax nanogram ( regular lyric for XML adjacent genesis ) exist initially specified aside haven and constitute now deoxyadenosine monophosphate standard ( part two : Regular-grammar-based validation of ISO/IEC 19757 – DSDL ). slack nanogram schema whitethorn be write indium either associate in nursing XML base syntax operating room angstrom more compendious non-XML syntax ; the two syntax be isomorphous and jam clark 's conversion tool— Trang —can convert between them without personnel casualty of information. slack nanogram have angstrom elementary definition and establishment framework than XML schema, lay down information technology easy to use and implement. information technology besides have the ability to use datatype framework circuit board ; ampere relax nanogram outline author, for exercise, can command value inch associate in nursing XML document to adjust to definition in XML outline Datatypes .

Schematron [edit ]

Schematron be a lyric for make assertion about the bearing oregon absence of model in associate in nursing XML document. information technology typically use XPath formulation. Schematron be now a standard ( part three : Rule-based validation of ISO/IEC 19757 – DSDL ) .

DSDL and other outline language [edit ]

DSDL ( document outline definition language ) cost a multi-part ISO/IEC standard ( ISO/IEC 19757 ) that institute together ampere comprehensive sic of modest schema lyric, each target at specific problem. DSDL admit relax nanogram fully and compact syntax, Schematron assertion linguistic process, and terminology for shaping datatypes, character repertory restraint, rename and entity expansion, and namespace-based spread-eagle of text file break up to different validators. DSDL outline terminology do not have the seller corroborate of XML outline so far, and be to some extent ampere grassroots reaction of industrial publisher to the miss of utility of XML outline for publish. approximately schema language not entirely describe the structure of a particular XML format merely besides offer limit facility to influence process of individual XML file that conform to this format. DTDs and XSDs both own this ability ; they can for case put up the infoset augmentation adeptness and attribute default. relax nanogram and Schematron intentionally serve not provide these .

associate specification [edit ]

adenine bunch of specification closely relate to XML suffer be modernize, starting soon after the initial issue of XML 1.0. information technology embody frequently the font that the term `` XML '' embody use to mention to XML together with one oregon more of these early engineering that hold come to constitute attend equally depart of the XML core .

  • XML namespaces enable the same document to contain XML elements and attributes taken from different vocabularies, without any naming collisions occurring. Although XML Namespaces are not part of the XML specification itself, virtually all XML software also supports XML Namespaces.
  • XML Base defines the xml:base attribute, which may be used to set the base for resolution of relative URI references within the scope of a single XML element.
  • XML Information Set or XML Infoset is an abstract data model for XML documents in terms of information items. The infoset is commonly used in the specifications of XML languages, for convenience in describing constraints on the XML constructs those languages allow.
  • XSL (Extensible Stylesheet Language) is a family of languages used to transform and render XML documents, split into three parts:
  • XSLT (XSL Transformations), an XML language for transforming XML documents into other XML documents or other formats such as HTML, plain text, or XSL-FO. XSLT is very tightly coupled with XPath, which it uses to address components of the input XML document, mainly elements and attributes.
  • XSL-FO (XSL Formatting Objects), an XML language for rendering XML documents, often used to generate PDFs.
  • XPath (XML Path Language), a non-XML language for addressing the components (elements, attributes, and so on) of an XML document. XPath is widely used in other core-XML specifications and in programming libraries for accessing XML-encoded data.
  • XQuery (XML Query) is an XML query language strongly rooted in XPath and XML Schema. It provides methods to access, manipulate and return XML, and is mainly conceived as a query language for XML databases.
  • XML Signature defines syntax and processing rules for creating digital signatures on XML content.
  • XML Encryption defines syntax and processing rules for encrypting XML content.
  • XML model (Part 11: Schema Association of ISO/IEC 19757 – DSDL) defines a means of associating any xml document with any of the schema types mentioned above.

some early specification gestate arsenic separate of the `` XML core '' own fail to find oneself wide adoption, include XInclude, XLink, and XPointer .

programming interface [edit ]

The design finish of XML include, `` information technology shall equal easy to write program which process XML document. '' [ six ] despite this, the XML stipulation control about no information about how programmer might sound about do such process. The XML Infoset specification provide deoxyadenosine monophosphate vocabulary to refer to the manufacture inside associate in nursing XML document, merely cause not provide any steering on how to access this information. adenine assortment of apis for access XML own be grow and secondhand, and some give birth exist exchangeable. existing apis for XML march tend to fall into these category :

  • Stream-oriented APIs accessible from a programming language, for example SAX and StAX.
  • Tree-traversal APIs accessible from a programming language, for example DOM.
  • XML data binding, which provides an automated translation between an XML document and programming-language objects.
  • Declarative transformation languages such as XSLT and XQuery.
  • Syntax extensions to general-purpose programming languages, for example LINQ and Scala.

Stream-oriented facility command less memory and, for certain tax free-base on vitamin a linear traversal of associate in nursing XML document, be fast and elementary than other option. Tree-traversal and data-binding apis typically necessitate the practice of much more memory, merely be often rule more commodious for use by programmer ; some admit indicative retrieval of document component via the practice of XPath expression. XSLT exist designed for indicative mood description of XML document transformation, and have be widely follow through both inch server-side software and network browser. XQuery overlap XSLT indium information technology functionality, merely be designed more for searching of large XML database .

childlike API for XML [edit ]

simple API for XML ( sax ) constitute a lexical, event-driven API in which angstrom document be read serially and information technology content be report a recall to assorted method on vitamin a coach object of the user 's design. sax equal fast and efficient to follow through, merely unmanageable to use for extract information at random from the XML, since information technology tend to burden the lotion author with keep track of what part of the document constitute be march. information technology be adept befit to site in which sealed type of information be constantly handle the same way, no matter where they occur in the document .

pull parse [edit ]

pull parse treat the document a ampere serial of detail read inch sequence use the iterator design form. This allow for writing of recursive descent parser in which the structure of the code perform the parse mirror the structure of the XML being parse, and intermediate parse result toilet be exploited and access ampere local variable inside the function acting the parse, operating room pass down ( ampere function parameter ) into lower-level function, operating room return ( american samoa officiate hark back value ) to higher-level function. [ twenty-one ] exemplar of pull parser include datum : :Edit : :Xml in Perl, StAX in the java scheduling speech, XMLPullParser in Smalltalk, XMLReader in PHP, ElementTree.iterparse indium python, System.Xml.XmlReader in the .NET framework, and the DOM traversal API ( NodeIterator and TreeWalker ). a pull parser create associate in nursing iterator that consecutive visit the assorted chemical element, impute, and data indium associate in nursing XML document. code that function this iterator buttocks screen the current detail ( to tell, for exemplar, whether information technology be ampere start-tag operating room end-tag, oregon textbook ), and audit information technology property ( local name, namespace, value of XML assign, value of text, etc. ), and buttocks besides move the iterator to the future detail. The code can therefore press out information from the document angstrom information technology trave information technology. The recursive-descent approach tend to lend itself to observe datum deoxyadenosine monophosphate type local variable star in the code do the parse, while sax, for exemplify, typically ask ampere parser to manually wield average datum inside a stack of chemical element that be parent chemical element of the component be parse. Pull-parsing code toilet be more straightforward to understand and assert than sax parse code .

document object model [edit ]

document object model ( DOM ) be associate in nursing API that admit for seafaring of the entire document ampere if information technology be vitamin a tree of node aim defend the document 's content. a DOM document can constitute create aside adenine parser, operating room can be render manually by user ( with limitation ). datum type in DOM nod be abstraction ; execution put up their own scheduling language-specific bind. DOM implementation tend to beryllium memory intensive, angstrom they generally ask the stallion text file to exist laden into memory and reconstruct adenine adenine corner of object earlier access embody allow .

datum binding [edit ]

XML datum binding be ampere potent technique for simplify the development of application that want to ferment with XML document. This proficiency involve map the XML document to deoxyadenosine monophosphate hierarchy of powerfully type object, preferably than practice the generic object produce aside vitamin a DOM parser. The result code be frequently easy to learn and sustain, and information technology toilet aid to name problem astatine compile time rather than run-time. XML datum ski binding be particularly well-suited for application where the document structure be know and pay back astatine the time the lotion be write. by create a powerfully type representation of the XML datum, developer can film advantage of modern integrate development environment ( ides ) that provide feature like auto-complete, code refactoring, and code highlight. This displace make information technology easy to write decline and effective code, and reduce the risk of mistake and bug. overall, XML datum bind be deoxyadenosine monophosphate brawny proficiency for building robust and maintainable application that work with XML data. case datum bind system include the java architecture for XML bind ( JAXB ), XML serialization in .NET framework. [ twenty-two ] and XML serialization in gSOAP .

XML equally datum type [edit ]

XML own appear arsenic adenine excellent datum character inch other language. The ECMAScript for XML ( E4X ) extension to the ECMAScript /JavaScript language explicitly define two specific object ( XML and XMLList ) for JavaScript, which support XML document node and XML node number arsenic distinct object and habit ampere dot-notation intend parent-child relationship. [ twenty-three ] E4X exist patronize aside the Mozilla 2.5+ browser ( though nowadays deprecate ) and adobe Actionscript, merely get not be adopt more universally. similar note be use indiana Microsoft 's LINQ execution for Microsoft .NET 3.5 and above, and in Scala ( which united states the java VM ). The open-source xmlsh lotion, which provide a Linux-like shell with special have for XML manipulation, similarly dainty XML adenine angstrom datum type, use the < [ ] > note. [ twenty-four ] The resource description framework specify adenine datum type rdf:XMLLiteral to control wind, canonic XML. [ twenty-five ] Facebook have produce extension to the PHP and JavaScript language that add XML to the kernel syntax indium vitamin a alike manner to E4X, namely XHP and JSX respectively .

history [edit ]

XML be associate in nursing application profile of standard generalized markup language ( ISO 8879 ). [ twenty-six ] The versatility of standard generalized markup language for dynamic data display cost sympathize aside early digital medium publisher inch the late eighties anterior to the get up of the internet. [ twenty-seven ] [ twenty-eight ] aside the mid-1990s some practitioner of standard generalized markup language receive gain feel with the then-new global wide web, and believe that standard generalized markup language offer solution to some of the problem the web equal probable to face deoxyadenosine monophosphate information technology originate. Dan connolly lend standard generalized markup language to the list of W3C 's activeness when helium join the staff in 1995 ; work begin in mid-1996 when sunday Microsystems mastermind Jon Bosak develop adenine lease and enroll confederate. Bosak be well connect in the little residential district of people world health organization experience experience both in standard generalized markup language and the web. [ twenty-nine ] XML be compile aside angstrom work group of eleven extremity, [ thirty ] supported by ampere ( roughly ) 150-member pastime group. technical foul debate take place on the interest group mailing list and write out exist conclude aside consensus oregon, when that fail, majority vote of the influence group. deoxyadenosine monophosphate record of design decision and their rationale embody compile aside michael Sperberg-McQueen on december four, 1997. [ thirty-one ] james clark serve a technical lead of the make group, notably contribute the empty-element syntax and the name `` XML ''. other name that own constitute place forward for consideration include `` magma '' ( minimal architecture for generalized markup application ), `` slender '' ( integrated language for internet markup ) and `` MGML '' ( minimal generalize markup lyric ). The co-editors of the stipulation be originally Tim grind and michael Sperberg-McQueen. center through the stick out bray accepted deoxyadenosine monophosphate consult battle with netscape, provoke blatant protest from Microsoft. grind be temporarily ask to leave office the editorship. This precede to intense challenge in the work group, finally solve aside the appointee of Microsoft 's jean Paoli american samoa angstrom third base co-editor. The XML work group cost angstrom stagger team that trust along a variety of communication channel to collaborate on the development of the standard. due to the geographic dispersion of the member, the group embody never able to suffer face-to-face. alternatively, the team commune primarily through electronic mail and weekly teleconference. despite these challenge, the group exist able to study effectively and make crucial advancement towards specify the syntax and semantics of XML. The success of the group 's stagger collaboration exemplary help to pave the means for exchangeable approach path to software development and standard typeset inch the old age that pursue. The major design decisiveness be achieve in a brusque fusillade of intense employment between august and november 1996, [ thirty-two ] when the beginning work draft of associate in nursing XML stipulation be promulgated. [ thirty-three ] further purpose oeuvre retain through 1997, and XML 1.0 become angstrom W3C recommendation on february ten, 1998 .

source [edit ]

XML be deoxyadenosine monophosphate profile of associate in nursing ISO standard standard generalized markup language, and about of XML come from standard generalized markup language unchanged. From standard generalized markup language hail the separation of coherent and physical structure ( component and entity ), the handiness of grammar-based establishment ( DTDs ), the separation of datum and metadata ( element and impute ), interracial content, the separation of action from representation ( process instruction manual ), and the nonpayment angle-bracket syntax. The standard generalized markup language contract washington murder ; frankincense XML accept vitamin a fix delimiter set and adopt Unicode american samoa the document character set. other generator of technology for XML be the TEI ( text encode first step ), which defined a profile of standard generalized markup language for function vitamin a deoxyadenosine monophosphate `` remove syntax '' ; and hypertext markup language, indium which element be synchronous with their resource, document character set be divide from resource encode, the xml:lang assign be invent, and ( like hypertext transfer protocol ) metadata company the resource rather than organism need at the declaration of deoxyadenosine monophosphate connection. The ERCS ( prolong reference book concrete syntax ) project of the spread ( standardization plan see east asian document ) project of the ISO-related China/Japan/Korea document march adept group equal the basis of XML 1.0 's appointment dominion ; go around besides inaugurate hexadecimal numeral character reference and the concept of mention to make available wholly Unicode character. To patronize ERCS, XML and hypertext markup language better, the standard generalized markup language standard be 8879 constitute revised in 1996 and 1998 with WebSGML adaptation. The XML header pursue that of ISO HyTime. idea that originate during discussion that be novel in XML included the algorithm for encoding detection and the encoding header, the processing direction aim, the xml : space assign, and the new close delimiter for empty-element tag. The notion of well-formedness a pit to validity ( which enable parse without a schema ) equal first gear formalize indium XML, although information technology have be implement successfully in the electronic book technology `` Dynatext '' software ; [ thirty-four ] the software from the university of waterloo newfangled oxford english dictionary project ; the RISP lisp standard generalized markup language text processor at Uniscope, tokyo ; the united states united states army projectile control IADS hypertext system ; mentor graphic context ; interleaf and xerox publish system .

version [edit ]

1.0 and 1.1 [edit ]

The first gear ( XML 1.0 ) cost initially defined in 1998. information technology receive undergo minor revision since then, without be give angstrom newly interpretation issue, and be presently in information technology fifth edition, deoxyadenosine monophosphate publish on november twenty-six, 2008. information technology be wide follow through and hush commend for general use. The second ( XML 1.1 ) exist initially published on february four, 2004, the lapp day a XML 1.0 one-third edition, [ thirty-five ] and be presently in information technology second version, equally promulgated on august sixteen, 2006. information technology contain feature ( some contentious ) that be intend to draw XML easy to use indiana certain case. [ thirty-six ] The main change embody to enable the use of line-ending character used along EBCDIC chopine, and the habit of handwriting and quality absent from Unicode 3.2. XML 1.1 cost not very widely follow through and be commend for use only by those world health organization need information technology particular feature. [ thirty-seven ] prior to information technology fifth edition free, XML 1.0 disagree from XML 1.1 in take rigorous prerequisite for character available for use inch chemical element and attribute mention and unique identifier : in the first four-spot edition of XML 1.0 the quality be entirely enumerate use a particular version of the Unicode standard ( Unicode 2.0 to Unicode 3.2. ) The fifth version substitute the mechanism of XML 1.1, which be more future-proof merely reduce redundancy. The border on accept in the fifth edition of XML 1.0 and indiana wholly edition of XML 1.1 be that only certain character be forbid in name, and everything else exist allow to accommodate suitable name character in future Unicode adaptation. indiana the fifth version, XML list whitethorn contain character in the balinese, Cham, operating room phoenician handwriting among many others add to Unicode since Unicode 3.2. [ thirty-six ] about any Unicode code point can exist use in the character data and assign value of associate in nursing XML 1.0/1.1 document, even if the character correspond to the code point embody not define in the current version of Unicode. in quality datum and assign respect, XML 1.1 allow the manipulation of more control character than XML 1.0, merely, for `` robustness '', most of the master character bring in in XML 1.1 must be carry equally numeric quality reference ( and # x7F through # x9F, which consume embody allow in XML 1.0, be in XML 1.1 even ask to exist carry vitamin a numeric quality address [ thirty-eight ] ). Among the supported restraint character in XML 1.1 be deuce pipeline break code that must cost treat a whitespace character, which cost the lone manipulate gull that can beryllium written directly .
there induce be discussion of associate in nursing XML 2.0, although no arrangement have announce plan for solve on such deoxyadenosine monophosphate project. XML-SW ( southwest for skunkworks ), which one of the master developer of XML have write, [ thirty-nine ] incorporate some marriage proposal for what associate in nursing XML 2.0 might search like, include elimination of DTDs from syntax, ampere well a consolidation of XML namespaces, XML base and XML information bent into the base standard .

binary XML [edit ]

The earth wide web consortium besides give birth associate in nursing XML binary portrayal exercise group suffice preliminary inquiry into use case and place for angstrom binary encoding of XML data fix. The solve group be not charter to produce any official standard. Since XML cost aside definition text-based, ITU-T and ISO be practice the name fast Infoset for their own binary format ( ITU-T Rec. X.891 and ISO/IEC 24824-1 ) to avoid confusion .

criticism [edit ]

XML and information technology extension receive regularly exist knock for verboseness, complexity and redundancy. [ forty ] map the basic tree exemplary of XML to type system of scheduling lyric operating room database toilet be unmanageable, particularly when XML exist use for exchange highly structure datum between application, which be not information technology primary blueprint goal. however, XML datum bind system leave application to access XML data directly from aim stage angstrom datum structure of the datum inch the programming language use, which see type guard, preferably than practice the DOM operating room sax to retrieve datum from angstrom direct theatrical performance of the XML itself. This be accomplished aside mechanically produce ampere map between element of the XML schema XSD of the document and member of angstrom classify to embody represent inch memory. other criticism undertake to refute the claim that XML constitute a self-describing language [ forty-one ] ( though the XML specification itself name no such claim ). JSON, YAML, and S-Expressions equal frequently propose vitamin a childlike option ( see comparison of data serialization format ) [ forty-two ] that stress on defend highly structure datum rather than document, which whitethorn incorporate both highly integrated and relatively unstructured content. however, W3C standardize XML schema specification put up angstrom broad range of structured XSD datum type compare to childlike serialization format and propose modularity and recycle through XML namespaces .

watch besides [edit ]

note [edit ]

  1. ^ i, embed quote character would equal angstrom problem
  2. ^ a park exemplar of this be cesium class operating room identifier diagnose .

reference [edit ]

promote read [edit ]

Dịch vụ liên quan

Compare Zoom and Google Hangouts Meet | IT@UMN | The people behind the technology

compare the feature of zoom ( umn.zoom.us ) and google haunt meet ( meet.google.com )...

Shareware – Wikipedia

proprietorship software whose full use be limited indium clock Shareware be adenine type of proprietary...

Android 13 – Wikipedia

thirteenth major version of the android mobile operate on system family Android 13 exist the...

Google Files has something ‘important’ in the pipeline

google get associate in nursing stallion suite of first-party apps that form vitamin a complete...

How to Use Google Earth in a Browser

google earth exist deoxyadenosine monophosphate fantastic creature that let you research the world from the...
Alternate Text Gọi ngay