I try to grok that: Apple is talking about “packets” in audio files, and there is a fancy function called AudioFileReadPackets which takes a lot of arguments. One of them specifies the “start packet”, and another one the number of packets which you want to read.
So I imagine an audio file to look like this, internally: It’s made up of a lot of packets. If it’s an audio file which has an variable bit rate format, then every packet may have a different size. If the file has an constant bit rate format, then every packet is the same size. So an audio file is like a truck full of boxes, and every box contains some interesting stuff.
Is that correct? Does it apply to any kind of file? Is this how files actually look like?
The question (even with the “especially audio files” qualification) is far too broad; different file formats are, well, different!
So to answer the question you will first have to specify a particular file type; then the answer to the question will invariably to look at its specification. Proprietary formats may not have a publicly available specification.
Specifications for many files (official and reverse engineered) can be found at the brilliant Wotsit’s Format site.
AAC used by Apple iTunes and others is defined by ISO/IEC 13818-7:2006. The document will cost you 252 Swiss Francs (about US$233)! You’d have to be really interested (commercially) to pay that rather than use an existing AAC Codec.
“Packet” is a term commonly used in data transmission, so may be more applicable to audio streaming than audio files, where a “frame” may be more appropriate, or for data files in general a “record”, but the terminology is flexible because it means whatever the person that wrote it thought it meant! If enough people misuse a term, it essentially becomes redefined (or multiply defined) to mean that, so I would not get too hung up on that. The author was do doubt using it to define a unit that has a defined format within a file that has multiple such units repeated sequentially.