iOS中摄像头录像并输出x265视频

2021-09-01

今天尝试了一下使用开源的X265 Library在iOS设备上进行编码，实际结果不尽人意，效率十分渣，当然这有很大一部分因为是本人并没有进行任何效率上的优化，但是按照本人手上的iPad Air2的转码效率来看，感觉想做到实时进行高清晰度的x265软编码直播之类的感觉很难实现，不过辛苦了一天还是把过程mark下来吧。
下载x265 library源码下载链接
编译出iOS能使用的x265 library 在网上找了下，基本上是木有iOS的编译方法，不过把x265源码下载下来之后，在源码的build目录下可以找到一个XCode的文件夹，文件夹下有一个make-project.sh的文件，该文件是官方用于创建Mac OS下使用的libx265库的XCode项目的脚本，因为我对cmake并不是太了解，网上又木有找到相关资料的情况下，只能参考着Mac OS项目创建一个iOS的项目来编译了。

文章图片
1.png 只需要在文件目录下使用终端运行脚本，就可以生成Mac OS使用的XCode项目。

chmod +x make-project.sh ./make-prioject.sh

然后按照着该项目包含的h、m文件，创建了一个拥有相同h、m文件的iOS项目，实际就是把x265 Mac OS项目中包含的h、m文件都复制到iOS项目下并引入iO
S项目，同时在对应的Target中，Copy Files增加x265.h和x265_config.h文件，这两个文件需要暴露出来，在使用的时候主要是使用x265.h这个文件的方法，同时增加一个.pch文件，在Mac OS项目Build Setting中的预编译宏设置中有一些参数的设置，iOS项目这边我把它搬到.pch这里来了，看参数意义感觉有一些是有用的，有一些应该没啥用，不过我也照搬过来了，最后只要准确参照着Mac OS项目的文件内容，它有的我们iOS项目也有，它没的我们也别加进来，那么这个iOS项目是能够编译成功的，编译的时候记得选择Generic iOS Device，并且在真机使用。

iOS中摄像头录像并输出x265视频

文章图片
2.png

iOS中摄像头录像并输出x265视频

文章图片
3.png

iOS中摄像头录像并输出x265视频

文章图片
4.png 将编译好的x265 library加入项目中我又创建了一个新的iOS项目，用于实验这个文章主题，然后我们就引入上面编译出来的x265 library到这个项目吧，直接拖进来，然后增加一个libstdc++的库引用，x265库用到的。

iOS中摄像头录像并输出x265视频

文章图片
5.png 图中的CameraOutpuxX265的项目就是我创建用于实验文章主题的iOS项目，时间问题，我就只是简单的创建了一个Single View Application的项目，XCode->New->iOS->Single View Application，然后拖入编译好的x265 library，图中框着的x265-iOS就是我编译好的x265库，然后Linked Frameworks and Libraries那增加一个libstdc++库，因为x265库是用c++编写的，基本的c++ std库要加上。
准备的东西都准备好了，开始具体的实现了...
从iOS设备的摄像头获取视频源数据如何获取摄像头的数据，这个文章有很多，我就贴贴代码，代码整个复制黏贴到新建的项目的ViewController.m下，然后在Main.Storyboard中的ViewController上增加一个View，这个View控件与ViewController.m中的viewCapture连接上，项目跑起来，应该就能看到摄像头拍摄的内容了。
ViewController.m

@interface ViewController () @property (nonatomic,weak) IBOutlet UIView *viewCapture; @property (nonatomic,strong) AVCaptureVideoPreviewLayer *captureVideoPreviewLayer; @property (nonatomic,strong) AVCaptureSession *captureSession; @property (nonatomic,strong) AVCaptureConnection *captureVideoConnection; ... @end@implementation ViewController- (void)viewDidLoad { [super viewDidLoad]; [self initCapture]; }-(void)viewDidAppear:(BOOL)animated { [super viewDidAppear:animated]; [self start]; }-(void)viewDidDisappear:(BOOL)animated { [super viewDidDisappear:animated]; [self stop]; }- (void)didReceiveMemoryWarning { [super didReceiveMemoryWarning]; // Dispose of any resources that can be recreated. }- (void)initCapture { self.captureSession = [[AVCaptureSession alloc] init]; AVCaptureDevice* inputDevice = [AVCaptureDevice defaultDeviceWithMediaType:AVMediaTypeVideo]; AVCaptureDeviceInput *captureInput = [AVCaptureDeviceInput deviceInputWithDevice:inputDevice error:nil]; [self.captureSession addInput:captureInput]; AVCaptureVideoDataOutput *captureOutput = [[AVCaptureVideoDataOutput alloc] init]; [captureOutput setAlwaysDiscardsLateVideoFrames:YES]; [captureOutput setSampleBufferDelegate:self queue:dispatch_get_global_queue(DISPATCH_QUEUE_PRIORITY_HIGH, 0)]; NSString* key = (NSString *)kCVPixelBufferPixelFormatTypeKey; NSNumber* value = https://www.it610.com/article/[NSNumber numberWithUnsignedInt:kCVPixelFormatType_420YpCbCr8BiPlanarVideoRange]; //Pixel Format NV12 NSDictionary *videoSettings = [NSDictionary dictionaryWithObject:value forKey:key]; [captureOutput setVideoSettings:videoSettings]; [self.captureSession setSessionPreset:AVCaptureSessionPreset352x288]; [self.captureSession addOutput:captureOutput]; [self setCaptureVideoConnection:[captureOutput connectionWithMediaType:AVMediaTypeVideo]]; [self setCaptureVideoPreviewLayer:[AVCaptureVideoPreviewLayer layerWithSession:self.captureSession]]; [self.captureVideoPreviewLayer setFrame:self.view.bounds]; [self.captureVideoPreviewLayer setVideoGravity:AVLayerVideoGravityResizeAspect]; [self.captureVideoPreviewLayer connection]; [self.viewCapture.layer addSublayer:self.captureVideoPreviewLayer]; }-(void)start { [self.captureSession startRunning]; }-(void)stop { [self.captureSession stopRunning]; }#pragma mark - AVCaptureVideoDataOutputSampleBufferDelegate- (void)captureOutput:(AVCaptureOutput *)captureOutput didOutputSampleBuffer:(CMSampleBufferRef)sampleBuffer fromConnection:(AVCaptureConnection *)connection { if (connection == self.captureVideoConnection) {CVPixelBufferRef imageBuffer = CMSampleBufferGetImageBuffer(sampleBuffer); if (CVPixelBufferLockBaseAddress(imageBuffer, 0) == kCVReturnSuccess) { //这里就是实时获取到的摄像头源数据 }CVPixelBufferUnlockBaseAddress(imageBuffer, 0); } } @end

将摄像头获取到的源数据转为x265可进行编码压缩的源数据在这里要说的是，x265这个库的使用，我是学习以下文章的
最简单的视频编码器：基于libx265（编码YUV为H.265）

啥是NV12，这个另外自己百度了，因为我不是专门搞视频的，开始也不知道，专门百度了下，当然你还可以Google、Bing、搜狗...反正容易点来说，iOS摄像头录下的每一帧原始数据，可以指定的几种类型当中，包括这种NV12的，这编文章使用的就是这种NV12
啥是yuv420，这个也继续自己搜索看看吧，简单来说就是x265 library这个库进行编码压缩原数据的时候，可输入处理的几种原数据中的一种，这编文章就选用这种格式了，因为我看到上面推荐的文章用的这个，其他我也不会...

接下来我们主要关注在上面ViewController.m代码的AVCaptureVideoDataOutputSampleBufferDelegate中：

#pragma mark - AVCaptureVideoDataOutputSampleBufferDelegate- (void)captureOutput:(AVCaptureOutput *)captureOutput didOutputSampleBuffer:(CMSampleBufferRef)sampleBuffer fromConnection:(AVCaptureConnection *)connection { if (connection == self.captureVideoConnection) {CVPixelBufferRef imageBuffer = CMSampleBufferGetImageBuffer(sampleBuffer); if (CVPixelBufferLockBaseAddress(imageBuffer, 0) == kCVReturnSuccess) { //这里就是实时获取到的摄像头源数据 }CVPixelBufferUnlockBaseAddress(imageBuffer, 0); } }

在获取到视频源数据之后，我们着手要做的就是把摄像头获取到的数据(NV12数据)转为x265库能进行编码压缩的源数据(yuv420)，从摄像头直接拿下来的数据是不能直接交给x265开源库来处理的，所以增加以下方法：

-(NSData*)convertYUV420FromNV12ImageBuffer:(CVPixelBufferRef)imageBuffer {UInt8 *bufferbasePtr = (UInt8 *)CVPixelBufferGetBaseAddress(imageBuffer); UInt8 *bufferPtr = (UInt8 *)CVPixelBufferGetBaseAddressOfPlane(imageBuffer,0); UInt8 *bufferPtr1 = (UInt8 *)CVPixelBufferGetBaseAddressOfPlane(imageBuffer,1); size_t buffeSize = CVPixelBufferGetDataSize(imageBuffer); size_t width = CVPixelBufferGetWidth(imageBuffer); size_t height = CVPixelBufferGetHeight(imageBuffer); size_t bytesPerRow = CVPixelBufferGetBytesPerRow(imageBuffer); size_t bytesrow0 = CVPixelBufferGetBytesPerRowOfPlane(imageBuffer,0); size_t bytesrow1= CVPixelBufferGetBytesPerRowOfPlane(imageBuffer,1); size_t bytesrow2 = CVPixelBufferGetBytesPerRowOfPlane(imageBuffer,2); size_t yuv420_len = sizeof(UInt8) * width * height * 3 / 2; UInt8 *yuv420_data = https://www.it610.com/article/malloc(yuv420_len); // buffer to store YUV with layout YYYYYYYYUUVV/* convert NV12 data to YUV420*/ UInt8 *pY = bufferPtr ; UInt8 *pUV = bufferPtr1; UInt8 *pU = yuv420_data + width * height; UInt8 *pV = pU + width * height / 4; for(int i = 0; i < height; i++) { memcpy(yuv420_data + i * width, pY + i * bytesrow0, width); } for(int j = 0; j < height / 2; j++) { for(int i =0; i < width / 2; i++) { *(pU++) = pUV[i << 1]; *(pV++) = pUV[(i << 1) + 1]; } pUV += bytesrow1; }NSData *yuv420Frame = [NSData dataWithBytes:yuv420_data length:yuv420_len]; free(yuv420_data); return yuv420Frame; }

可以看到，这个方法就是把从摄像头获取到imageBuffer转为yuv420格式的数据，并封装为一个iOS的NSData的方法。
好了，视频数据也准备好了，接下来我们来初始化x265库。
x265 初始化直接贴代码，增加两个属性，因为都不是OC对象，所以我给了他们assign修饰词，在ViewController的delloc方法中，记得要对其进行release操作，主要关注x265Param和x265Encoder，x265Param时设置编码的一些参数的，更具体的可以进到x265Param的头文件看看属性说明，x265Encoder就是我们的编码器啦，下面是具体初始化编码器的我方法，东西不多，直接看代码就能明白。

其实这些代码一直都是ViewController.m文件下的代码，只是为了关注我说明的内容，我贴出来的都是删除掉了上面说到其他内容的代码的

@interface ViewController ()... @property (strong) NSMutableArray *yuv420Frames; //用于保存从摄像头获取到并转为yuv420的数组 @property (strong) NSMutableData *dataX265; //用于保存从yuv420数据转为x265视频数据@property (nonatomic,assign) x265_param *x265Param; @property (nonatomic,assign) x265_encoder *x265Encoder; @end@implementation ViewController- (void)viewDidLoad { [super viewDidLoad]; [self initData]; ... }-(void)initData {[self setYuv420Frames:[NSMutableArray array]]; [self setDataX265:[NSMutableData data]]; //根据视频实际分辨率设定 int width = 352; int height = 288; self.x265Param = x265_param_alloc(); x265_param_default(self.x265Param); self.x265Param->bRepeatHeaders = 1; self.x265Param->internalCsp = X265_CSP_I420; //输入yuv420格式 self.x265Param->sourceWidth = width; self.x265Param->sourceHeight = height; self.x265Param->fpsNum = 18; self.x265Param->fpsDenom = 1; self.x265Encoder = x265_encoder_open(self.x265Param); }-(void)dealloc {if (self.x265Encoder) { x265_encoder_close(self.x265Encoder); }if (self.x265Param) { x265_param_free(self.x265Param); } }... @end

使用x265进行转码继续贴代码，这个方法就是把yuv420的数据通过x265Encoder进行转码，并且appendData到self.dataX265中，实际流程上来说摄像头获取到的数据->转yuv420数据->转x265数据，方法如下：

-(void)encodeX265FromYuv420Frame:(NSData*)yuv420Frame {UInt8 *yuv420_buf = (UInt8*)yuv420Frame.bytes; size_t yuv420_len = yuv420Frame.length; //encode x265 x265_picture *x265Pic = NULL; char *x265PicBuf = NULL; int width = self.x265Param->sourceWidth; int height = self.x265Param->sourceHeight; int pixeSize = width * height; x265Pic = x265_picture_alloc(); x265_picture_init(self.x265Param, x265Pic); x265PicBuf = malloc(sizeof(char) * pixeSize * 3 / 2); x265Pic->planes[0] = x265PicBuf; x265Pic->planes[1] = x265PicBuf + pixeSize; x265Pic->planes[2] = x265PicBuf + pixeSize * 5 / 4; x265Pic->stride[0] = width; x265Pic->stride[1] = width / 2; x265Pic->stride[2] = width / 2; memcpy(x265Pic->planes[0], yuv420_buf, pixeSize); memcpy(x265Pic->planes[1], yuv420_buf + pixeSize, pixeSize / 4); memcpy(x265Pic->planes[2], yuv420_buf + pixeSize * 5 / 4, pixeSize / 4); x265_nal *x265NalPp = NULL; uint32_t x265NalPi = 0; x265_encoder_encode(self.x265Encoder, &x265NalPp, &x265NalPi, x265Pic, NULL); for (int i = 0; i < x265NalPi; i++) {uint8_t* payload = x265NalPp[i].payload; uint32_t sizeBytes = x265NalPp[i].sizeBytes; [self.dataX265 appendBytes:payload length:sizeBytes]; }x265_encoder_encode(self.x265Encoder, &x265NalPp, &x265NalPi, NULL, NULL); for (int i = 0; i < x265NalPi; i++) {uint8_t* payload = x265NalPp[i].payload; uint32_t sizeBytes = x265NalPp[i].sizeBytes; [self.dataX265 appendBytes:payload length:sizeBytes]; }x265_picture_free(x265Pic); free(x265PicBuf); }

转出来的就是x265的每一帧的数据啦，把数据拼接起来输出保存为文件，就是我们的x265视频了。
这个实验项目我上传到了Github，下面是地址：
https://github.com/ljx09195117/CameraOutoutX265-iOS
【iOS中摄像头录像并输出x265视频】x265的编译项目我就不上传了，希望能找到更标准的编译方法，这次就写这么多吧~

推荐阅读

上一篇：iQOO君临天下！骁龙855+三摄+全新配色，强势登陆NBA全明星！

下一篇：女人，你怎么那么喜欢逛街啊?