Media types

An Internet media type is, generally speaking, a property of a data set, describing both the general type of data (such as "text" or "image" or "application"; the last one refers to program-specific internal data formats) and, as a subtype, a specific format for the data. The concept was originally defined as "MIME content types".

Media types relate to HTML as follows:

The HTML 3.2 Reference Specification refers to RFC 1521 but that specification was superseded by RFC 2046 (in November 1996). The procedure for registering types in given in RFC 2048; according to it, the registry is kept at ftp://ftp.isi.edu/in-notes/iana/assignments/media-types/

For less authoritative but more readably presented information, see document MIME Types by Chris Herborth.

In addition to standardized media types, there are media types which are in fact supported by popular servers and browsers. Appendix B of Special Edition Using CGI lists many of them.

You can check what is the media type information sent by a server as follows: Assuming we are interested in the media type of the document at URL http://host/path, establish a Telnet connection to host using the port number in the URL if present, port 80 otherwise. Then give the command
HEAD /path HTTP/1.0
and then an empty line. Example (where the Telnet connection is established by starting the telnet program from Unix command level):

beta ~ 51 % telnet www.hut.fi 80
Trying 130.233.224.28...
Connected to info-e.hut.fi.
Escape character is '^]'.
HEAD /home/jkorpela/perhe.jpg HTTP/1.0
HTTP/1.1 200 OK
Date: Tue, 23 Sep 1997 12:37:05 GMT
Server: Apache/1.2.4
Last-Modified: Tue, 08 Aug 1995 08:29:53 GMT
ETag: "16391-9232-30272081"
Content-Length: 37426
Accept-Ranges: bytes
Connection: close
Content-Type: image/jpeg
Connection closed by foreign host.
beta ~ 52 % exit

Here the Content-Type: field tells that the media type is image/jpeg.