新博客:
完整版 - AVFoundation Programming Guide分章节版:
— 第1章:About AVFoundation - AVFoundation概述
— 第2章:Using Assets - 使用Assets
— 第3章:Playback - 播放
— 第4章:Editing - 编辑
— 第5章:Still and Video Media Capture - 静态视频媒体捕获
— 第6章:Export - 输出
— 第7章:Time and Media Representations 时间和媒体表现CSDN博客:
完整版 - AVFoundation Programming Guide分章节版:
— 第1章:About AVFoundation - AVFoundation概述
— 第2章:Using Assets - 使用Assets
— 第3章:Playback - 播放
— 第4章:Editing - 编辑
— 第5章:Still and Video Media Capture - 静态视频媒体捕获
— 第6章:Export - 输出
— 第7章:Time and Media Representations 时间和媒体表现版权声明:本文为博主原创翻译,如需转载请注明出处。
苹果源文档地址 - 点击这里
Export - 输出
To read and write audiovisual assets, you must use the export APIs provided by the AVFoundation framework. The AVAssetExportSession class provides an interface for simple exporting needs, such as modifying the file format or trimming the length of an asset (see Trimming and Transcoding a Movie). For more in-depth exporting needs, use the AVAssetReader and AVAssetWriter classes.
必须使用 AVFoundation
框架提供的导出 APIs
去读写音视频资产。AVAssetExportSession 类为简单输出需要,提供了一个接口,例如修改文件格式或者削减资产的长度(见 Trimming and Transcoding a Movie)。为了更深入的导出需求,使用 AVAssetReader 和 AVAssetWriter 类。
Use an AVAssetReader when you want to perform an operation on the contents of an asset. For example, you might read the audio track of an asset to produce a visual representation of the waveform. To produce an asset from media such as sample buffers or still images, use an AVAssetWriter object.
当你想对一项资产的内容进行操作时,使用 AVAssetReader
。例如,可以读取一个资产的音频轨道,以产生波形的可视化表示。为了从媒体(比如样品缓冲或者静态图像)生成资产,使用 AVAssetWriter
对象。
Note: The asset reader and writer classes are not intended to be used for real-time processing. In fact, an asset reader cannot even be used for reading from a real-time source like an HTTP live stream. However, if you are using an asset writer with a real-time data source, such as an AVCaptureOutput object, set the expectsMediaDataInRealTime property of your asset writer’s inputs to YES. Setting this property to YES for a non-real-time data source will result in your files not being interleaved properly.
注意:资产
reader
和writer
类不打算用到实时处理。实际上,一个资产读取器甚至不能用于从一个类似HTTP
直播流的实时资源中读取。然而,如果你使用带着实时数据资源的资产写入器,比如 AVCaptureOutput 对象,设置资产写入器入口的 expectsMediaDataInRealTime 属性为YES
。将此属性设置为YES
的非实时数据源将导致你的文件不能被正确的扫描。
Reading an Asset - 读取资产
Each AVAssetReader object can be associated only with a single asset at a time, but this asset may contain multiple tracks. For this reason, you must assign concrete subclasses of the AVAssetReaderOutput class to your asset reader before you begin reading in order to configure how the media data is read. There are three concrete subclasses of the AVAssetReaderOutput base class that you can use for your asset reading needs: AVAssetReaderTrackOutput, AVAssetReaderAudioMixOutput, and AVAssetReaderVideoCompositionOutput.
每个 AVAssetReader
对象只能与单个资产有关,但这个资产可能包含多个轨道。为此,你必须指定 AVAssetReaderOutput 类的具体子类给你的资产读取器,在你开始按顺序访问你的资产以配置如何读取数据之前。有 AVAssetReaderOutput
基类的3个具体子类,可以使用你的资产访问需求 AVAssetReaderTrackOutput,AVAssetReaderAudioMixOutput,AVAssetReaderVideoCompositionOutput。
Creating the Asset Reader - 创建资产读取器
All you need to initialize an AVAssetReader object is the asset that you want to read.
所有你需要去初始化 AVAssetReader
对象是你想要访问的资产。
1 | NSError *outError; |
Note: Always check that the asset reader returned to you is non-nil to ensure that the asset reader was initialized successfully. Otherwise, the error parameter (outError in the previous example) will contain the relevant error information.
注意:总是要资产读取器是否返回给你的时
non-nil
,以确保资产读取器已经成功被初始化。否则,错误参数(之前的例子中outError
)将会包含有关错误的信息。
Setting Up the Asset Reader Outputs - 建立资产读取器出口
After you have created your asset reader, set up at least one output to receive the media data being read. When setting up your outputs, be sure to set the alwaysCopiesSampleData property to NO. In this way, you reap the benefits of performance improvements. In all of the examples within this chapter, this property could and should be set to NO.
在你创建了资产读取器之后,至少设置一个出口以接收正在读取的媒体数据。当建立你的出口,确保设置 alwaysCopiesSampleData 属性为 NO
。这样,你就收获了性能改进的好处。这一章的所有例子中,这个属性可以并且应该被设置为 NO
。
If you want only to read media data from one or more tracks and potentially convert that data to a different format, use the AVAssetReaderTrackOutput class, using a single track output object for each AVAssetTrack object that you want to read from your asset. To decompress an audio track to Linear PCM with an asset reader, you set up your track output as follows:
如果你只想从一个或多个轨道读取媒体数据,潜在的数据转换为不同的格式,使用 AVAssetReaderTrackOutput
类,每个你想从你的资产中读取 AVAssetTrack 对象都使用单轨道出口对象。将音频轨道解压缩为有资产读取器的 Linear PCM
,建立轨道出口如下:
1 | AVAsset *localAsset = assetReader.asset; |
Note: To read the media data from a specific asset track in the format in which it was stored, pass nil to the outputSettings parameter.
注意:从一个特定的资产轨道读取媒体数据,以它被存储的格式,传
nil
给outputSettings
参数。
You use the AVAssetReaderAudioMixOutput and AVAssetReaderVideoCompositionOutput classes to read media data that has been mixed or composited together using an AVAudioMix object or AVVideoComposition object, respectively. Typically, these outputs are used when your asset reader is reading from an AVComposition object.
使用 AVAssetReaderAudioMixOutput
和 AVAssetReaderVideoCompositionOutput
类来读取媒体数据,这些媒体数据是分别使用 AVAudioMix 对象或者 AVVideoComposition 对象混合或者组合在一起。通常情况下,当你的资产读取器正在从 AVComposition 读取时,才使用这些出口。
With a single audio mix output, you can read multiple audio tracks from your asset that have been mixed together using an AVAudioMix object. To specify how the audio tracks are mixed, assign the mix to the AVAssetReaderAudioMixOutput object after initialization. The following code displays how to create an audio mix output with all of the audio tracks from your asset, decompress the audio tracks to Linear PCM, and assign an audio mix object to the output. For details on how to configure an audio mix, see Editing.
一个单一音频混合出口,可以从 已经使用 AVAudioMix
对象混合在一起的资产中读取多个音轨。指定音轨是如何被混合在一起的,将混合后的 AVAssetReaderAudioMixOutput
对象初始化。下面的代码显示了如何从资产中创建一个带着所有音轨的音频混合出口,将音轨解压为 Linear PCM
,并指定音频混合对象到出口。有如何配置音频混合的细节,请参见 Editing 。
1 | AVAudioMix *audioMix = < |
Note: Passing nil for the audioSettings parameter tells the asset reader to return samples in a convenient uncompressed format. The same is true for the AVAssetReaderVideoCompositionOutput class.
注意:给
audioSettings
参数传递nil
,告诉资产读取器返回一个方便的未压缩格式的样本。对于AVAssetReaderVideoCompositionOutput
类同样是可以的。
The video composition output behaves in much the same way: You can read multiple video tracks from your asset that have been composited together using an AVVideoComposition object. To read the media data from multiple composited video tracks and decompress it to ARGB, set up your output as follows:
视频合成输出行为有许多同样的方式:可以从资产(已经被使用 AVVideoComposition
对象合并在一起)读取多个视频轨道。从多个复合视频轨道读取媒体数据,解压缩为 ARGB
,建立出口如下:
1 | AVVideoComposition *videoComposition = < |
Reading the Asset’s Media Data - 读取资产媒体数据
To start reading after setting up all of the outputs you need, call the startReading method on your asset reader. Next, retrieve the media data individually from each output using the copyNextSampleBuffer method. To start up an asset reader with a single output and read all of its media samples, do the following:
开始读取后建立所有你需要的出口,在你的资产读取器中调用 startReading 方法。下一步,使用 copyNextSampleBuffer 方法从每个出口分别获取媒体数据。以一个出口启动一个资产读取器,并读取它的所有媒体样本,跟着下面做:
1 | // Start the asset reader up. |
Writing an Asset - 写入资产
The AVAssetWriter class to write media data from multiple sources to a single file of a specified file format. You don’t need to associate your asset writer object with a specific asset, but you must use a separate asset writer for each output file that you want to create. Because an asset writer can write media data from multiple sources, you must create an AVAssetWriterInput object for each individual track that you want to write to the output file. Each AVAssetWriterInput object expects to receive data in the form of CMSampleBufferRef objects, but if you want to append CVPixelBufferRef objects to your asset writer input, use the AVAssetWriterInputPixelBufferAdaptor class.
AVAssetWriter 类从多个源将媒体数据写入到指定文件格式的单个文件中。不需要将你的资产写入器与一个特定的资产联系起来,但你必须为你要创建的每个输出文件 使用一个独立的资产写入器。因为一个资产写入器可以从多个来源写入媒体数据,你必须为你想写入输出文件的每个独立的轨道创建一个 AVAssetWriterInput 对象。每个 AVAssetWriterInput
对象预计以 CMSampleBufferRef 对象的形成接收数据,但如果你想给你的资产写入器入口 附加 CVPixelBufferRef 对象,使用 AVAssetWriterInputPixelBufferAdaptor 类。
Creating the Asset Writer - 创建资产写入器
To create an asset writer, specify the URL for the output file and the desired file type. The following code displays how to initialize an asset writer to create a QuickTime movie:
为了创建一个资产写入器,为出口文件指定 URL
和所需的文件类型。下面的代码显示了如何初始化一个资产写入器来创建一个 QuickTime
影片:
1 | NSError *outError; |
Setting Up the Asset Writer Inputs - 建立资产写入器入口
For your asset writer to be able to write media data, you must set up at least one asset writer input. For example, if your source of media data is already vending media samples as CMSampleBufferRef objects, just use the AVAssetWriterInput class. To set up an asset writer input that compresses audio media data to 128 kbps AAC and connect it to your asset writer, do the following:
为你的资产写入器能够写入媒体数据,必须至少设置一个资产写入器入口。例如,如果你的媒体数据源已经以 CMSampleBufferRef
对象声明了声明了媒体样本,只使用 AVAssetWriterInput
类。建立一个资产写入器入口,将音频媒体数据压缩到 128 kbps AAC
并且将它与你的资产写入器连接,跟着下面做:
1 | // Configure the channel layout as stereo. |
Note: If you want the media data to be written in the format in which it was stored, pass nil in the outputSettings parameter. Pass nil only if the asset writer was initialized with a fileType of AVFileTypeQuickTimeMovie.
注意:如果你想让媒体数据以它被存储的格式写入,给
outputSettings
参数传nil
。只有资产写入器曾用 AVFileTypeQuickTimeMovie 的fileType
初始化,才传nil
。
Your asset writer input can optionally include some metadata or specify a different transform for a particular track using the metadata and transform properties respectively. For an asset writer input whose data source is a video track, you can maintain the video’s original transform in the output file by doing the following:
你的资产写入器入口可以选择性的包含一些元数据 或者 分别使用 metadata 和 transform 属性为特定的轨道指定不同的变换。对于一个资产写入器的入口,其数据源是一个视频轨道,可以通过下面示例来在输出文件中维持视频的原始变换:
1 | AVAsset *videoAsset = < |
Note: Set the metadata and transform properties before you begin writing with your asset writer for them to take effect.
注意:在开始用资产写入器写入生效之前,先设置
metadata
和transform
属性。
When writing media data to the output file, sometimes you may want to allocate pixel buffers. To do so, use the AVAssetWriterInputPixelBufferAdaptor class. For greatest efficiency, instead of adding pixel buffers that were allocated using a separate pool, use the pixel buffer pool provided by the pixel buffer adaptor. The following code creates a pixel buffer object working in the RGB domain that will use CGImage objects to create its pixel buffers.
当将媒体数据写入输出文件时,有时你可能要分配像素缓冲区。这样做:使用 AVAssetWriterInputPixelBufferAdaptor
类。为了最大的效率,使用由像素缓冲适配器提供的像素缓冲池,代替添加被分配使用一个单独池的像素缓冲区。下面的代码创建一个像素缓冲区对象,在 RGB
色彩下工作,将使用 CGImage 对象创建它的像素缓冲。
1 | NSDictionary *pixelBufferAttributes = @{ |
Note: All AVAssetWriterInputPixelBufferAdaptor objects must be connected to a single asset writer input. That asset writer input must accept media data of type AVMediaTypeVideo.
注:所有的
AVAssetWriterInputPixelBufferAdaptor
对象必须连接到一个单独的资产写入器入口。资产写入器入口必须接受 AVMediaTypeVideo 类型的媒体数据。
Writing Media Data - 写入媒体数据
When you have configured all of the inputs needed for your asset writer, you are ready to begin writing media data. As you did with the asset reader, initiate the writing process with a call to the startWriting method. You then need to start a sample-writing session with a call to the startSessionAtSourceTime: method. All writing done by an asset writer has to occur within one of these sessions and the time range of each session defines the time range of media data included from within the source. For example, if your source is an asset reader that is supplying media data read from an AVAsset object and you don’t want to include media data from the first half of the asset, you would do the following:
当你已经为资产写入器配置所有需要的入口时,这时已经准备好开始写入媒体数据。正如在资产读取器所做的,调用 startWriting 方法发起写入过程。然后你需要启动一个样本 — 调用 startSessionAtSourceTime: 方法的写入会话。资产写入器的所有写入都必须在这些会话中发生,并且每个会话的时间范围 定义 包含在来源内媒体数据的时间范围。例如,如果你的来源是一个资产读取器(它从 AVAsset 对象读取到供应的媒体数据),并且你不想包含来自资产的前半部分的媒体数据,你可以像下面这样做:
1 | CMTime halfAssetDuration = CMTimeMultiplyByFloat64(self.asset.duration, 0.5); |
Normally, to end a writing session you must call the endSessionAtSourceTime: method. However, if your writing session goes right up to the end of your file, you can end the writing session simply by calling the finishWriting method. To start up an asset writer with a single input and write all of its media data, do the following:
通常,必须调用 endSessionAtSourceTime: 方法结束写入会话。然而,如果你的写入会话正确走到了你的文件末尾,可以简单地通过调用 finishWriting 方法来结束写入会话。要启动一个有单一入口的资产写入器并且写入所有媒体数据。下面示例:
1 | // Prepare the asset writer for writing. |
The copyNextSampleBufferToWrite method in the code above is simply a stub. The location of this stub is where you would need to insert some logic to return CMSampleBufferRef objects representing the media data that you want to write. One possible source of sample buffers is an asset reader output.
上述代码中的 copyNextSampleBufferToWrite
方法仅仅是一个 stub
。这个 stub
的位置就是你需要插入一些逻辑 去返回 CMSampleBufferRef
对象 表示你想要写入的媒体数据。示例缓冲区的可能来源是一个资产读取器出口。
Reencoding Assets - 重新编码资产
You can use an asset reader and asset writer object in tandem to convert an asset from one representation to another. Using these objects, you have more control over the conversion than you do with an AVAssetExportSession object. For example, you can choose which of the tracks you want to be represented in the output file, specify your own output format, or modify the asset during the conversion process. The first step in this process is just to set up your asset reader outputs and asset writer inputs as desired. After your asset reader and writer are fully configured, you start up both of them with calls to the startReading and startWriting methods, respectively. The following code snippet displays how to use a single asset writer input to write media data supplied by a single asset reader output:
可以使用资产读取器和资产写入器对象,以一个表现转换到另一个表现的资产。使用这些对象,你必须比用 AVAssetExportSession
对象有更多的控制转换。例如,你可以选择输出文件中想要显示的轨道,指定你自己的输出格式,或者在转换过程中修改该资产。这个过程中第一步是按需建立你的资产读取器出口和资产写入器入口。资产读取器和写入器充分配置后,分别调用 startReading
和 startWriting
方法启动它们。下面的代码片段显示了如何使用一个单一的资产写入器入口去写入 由一个单一的资产读取器出口提供的媒体数据:
1 | NSString *serializationQueueDescription = [NSString stringWithFormat:@"%@ serialization queue", self]; |
Putting It All Together: Using an Asset Reader and Writer in Tandem to Reencode an Asset - 总结:使用资产读取器和写入器串联重新编码资产
This brief code example illustrates how to use an asset reader and writer to reencode the first video and audio track of an asset into a new file. It shows how to:
- Use serialization queues to handle the asynchronous nature of reading and writing audiovisual data
- Initialize an asset reader and configure two asset reader outputs, one for audio and one for video
- Initialize an asset writer and configure two asset writer inputs, one for audio and one for video
- Use an asset reader to asynchronously supply media data to an asset writer through two different - output/input combinations
- Use a dispatch group to be notified of completion of the reencoding process
- Allow a user to cancel the reencoding process once it has begun
这个剪短的代码示例说明如何使用资产读取器和写入器将一个资产的第一个视频和音频轨道重新编码 到一个新文件。它展示了:
- 使用序列化队列来处理读写视听数据的异步性
- 初始化一个资产读取器,并配置两个资产读取器出口,一个用于音频,一个用于视频
- 初始化一个资产写入器,并配置两个资产写入器入口,一个用于音频,一个用于视频
- 使用一个资产读取器,通过两个不同的 输出/输入组合来异步向资产写入器提供媒体数据
- 使用一个调度组接收重新编码过程的完成的通知
- 一旦开始,允许用户取消重新编码过程
Note: To focus on the most relevant code, this example omits several aspects of a complete application. To use AVFoundation, you are expected to have enough experience with Cocoa to be able to infer the missing pieces.
注:关注最相关的代码,这个例子中省略了一个完成应用程序的几个方面。为了使用
AVFoundation
,希望你有足够的Cocoa
经验,能够推断缺少的代码。
Handling the Initial Setup - 处理初始设置
Before you create your asset reader and writer and configure their outputs and inputs, you need to handle some initial setup. The first part of this setup involves creating three separate serialization queues to coordinate the reading and writing process.
在创建资产读取器和写入器和配置它们的出口和入口之前,你需要处理一下初始设置。此设置的第一部分包括创建3个独立的序列化队列来协调读写过程。
1 | NSString *serializationQueueDescription = [NSString stringWithFormat:@"%@ serialization queue", self]; |
The main serialization queue is used to coordinate the starting and stopping of the asset reader and writer (perhaps due to cancellation) and the other two serialization queues are used to serialize the reading and writing by each output/input combination with a potential cancellation.
主序列队列用于协调资产读取器和写入器(可能是由于注销)的启动和停止,其他两个序列队列用于序列化读取器和写入器,通过每一个有潜在注销的输入/输出组合。
Now that you have some serialization queues, load the tracks of your asset and begin the reencoding process.
现在你有一些序列化队列,加载你的资产轨道,并开始重新编码过程。
1 | self.asset = < |
When the track loading process finishes, whether successfully or not, the rest of the work is dispatched to the main serialization queue to ensure that all of this work is serialized with a potential cancellation. Now all that’s left is to implement the cancellation process and the three custom methods at the end of the previous code listing.
当轨道加载过程结束后,无论成功与否,剩下的工作就是被分配到主序列队列以确保所有的工作都是有潜在注销的序列化。现在,剩下就是实现注销进程和前面的代码清单的结尾处的3个自定义方法。
Initializing the Asset Reader and Writer - 初始化资产读取器和写入器
The custom setupAssetReaderAndAssetWriter: method initializes the reader and writer and configures two output/input combinations, one for an audio track and one for a video track. In this example, the audio is decompressed to Linear PCM using the asset reader and compressed back to 128 kbps AAC using the asset writer. The video is decompressed to YUV using the asset reader and compressed to H.264 using the asset writer.
自定义 setupAssetReaderAndAssetWriter:
方法初始化读取器和写入器,并且配置两个输入/输出组合,一个用于音频轨道,一个用于视频轨道。在这个例子中,使用资产读取器音频被解压缩到 Linear PCM
,使用资产写入器压缩回 128 kbps AAC
。使用资产读取器将视频解压缩到 YUV
,使用资产写入器压缩为 H.264
。
1 |
|
Reencoding the Asset - 重新编码资产
Provided that the asset reader and writer are successfully initialized and configured, the startAssetReaderAndWriter: method described in Handling the Initial Setup is called. This method is where the actual reading and writing of the asset takes place.
如果资产读取器和写入器成功地初始化和配置,在 Handling the Initial Setup 中发现调用 startAssetReaderAndWriter:
方法。这个方法实际上是资产读写发生的地方。
1 | - (BOOL)startAssetReaderAndWriter:(NSError **)outError |
During reencoding, the audio and video tracks are asynchronously handled on individual serialization queues to increase the overall performance of the process, but both queues are contained within the same dispatch group. By placing the work for each track within the same dispatch group, the group can send a notification when all of the work is done and the success of the reencoding process can be determined.
重新编码期间,音频和视频轨道是在各自的串行队形上异步处理,来增加进程的整体性能,但两个队列包含在同一调度组中。为同一调度组内的每个轨道安排工作,当所有的工作完成,并能够确定重新编码过程的成功,该组可以发送一个通知。
Handling Completion - 处理完成
To handle the completion of the reading and writing process, the readingAndWritingDidFinishSuccessfully: method is called—with parameters indicating whether or not the reencoding completed successfully. If the process didn’t finish successfully, the asset reader and writer are both canceled and any UI related tasks are dispatched to the main queue.
处理读写进程的完成,readingAndWritingDidFinishSuccessfully:
方法被调用,带着参数,指出重新编码是否成功完成。如果进程没有成功完成,该资产读取器和写入器都被取消,任何 UI
相关的任何都被发送到主队列中。
1 | - (void)readingAndWritingDidFinishSuccessfully:(BOOL)success withError:(NSError *)error |
Handling Cancellation - 处理注销
Using multiple serialization queues, you can allow the user of your app to cancel the reencoding process with ease. On the main serialization queue, messages are asynchronously sent to each of the asset reencoding serialization queues to cancel their reading and writing. When these two serialization queues complete their cancellation, the dispatch group sends a notification to the main serialization queue where the cancelled property is set to YES. You might associate the cancel method from the following code listing with a button on your UI.
使用多个序列化队列,你可以提供方便,让你的应用程序的用户取消重新编码进程。在主串行队列,消息被异步发送到每个资产重编码序列化队列,来取消它们的读写。当这两个序列化队列完成它们的注销,调度组向主序列化队列(cancelled
属性被设置为 YES
)发送一个通知.你可能从下面的代码将 cancel
方法与 UI
上的按钮关联起来。
1 | - (void)cancel |
Asset Output Settings Assistant - 资产出口设置助手
The AVOutputSettingsAssistant class aids in creating output-settings dictionaries for an asset reader or writer. This makes setup much simpler, especially for high frame rate H264 movies that have a number of specific presets. Listing 5-1 shows an example that uses the output settings assistant to use the settings assistant.
AVOutputSettingsAssistant 类在创建出口时能帮上忙 — 为资产读取器或者写入器设置字典。这使得设置更简单,特别是对于有一些具体的预设的高帧速率 H264
影片。 Listing 5-1
显示了使用输出设置助手去使用设置助手的例子。
Listing 5-1 AVOutputSettingsAssistant sample
1 | AVOutputSettingsAssistant *outputSettingsAssistant = [AVOutputSettingsAssistant outputSettingsAssistantWithPreset:<some preset>]; |