I read an article about captioning videos and I want to use solution number 4 (extract features with a CNN, pass the sequence to a separate RNN) in my own project.
But for me, it seems really strange that in this method we use the Inception model without any retraining or something like that. Every project has different requirements and even if you use pretrained model instead of your own, you should do some training.
And I wonder how to do this? For example, I created a project where I use the network with CNN layers and then LSTM and Dense layers. And in every epoch, there is feed-forward and backpropagation through the whole network, all layers. But what if you have CNN network to extract features and LSTM network that takes sequences as inputs. How to train CNN network if there is no defined output? This network should only extract features but the network doesn't know what features. So the question is: How to train CNN to extract relevant features and then passing these features to LSTM?
相关知识
Extract features with CNN and pass as sequence to RNN
【深度学习论文翻译】Learning Spatiotemporal Features with 3D Convolutional Networks全文对照翻译
sequence 口袋妖怪 神奇宝贝
RNN
CATS CLAW EXTRACT CAS#:
基于深度神经网络的多模态情感识别
《自然:神经科学》论文:动物视觉系统里的RNN能加速物体识别
揭秘喵大人算法:宠物AI如何读懂你的爱宠心声?
使用深度学习进行语音情感识别:案例演示与代码实现
12
网址: Extract features with CNN and pass as sequence to RNN https://m.mcbbbk.com/newsview1346950.html
| 上一篇: 海淘狗狗狗粮,如何挑选最适合爱宠 |
下一篇: 宠物医疗事故赔偿标准是多少 |