Development of a method for estimating asari clam distribution by combining three-dimensional acoustic coring system and deep neural network

Kadoi, Tokimu; Mizuno, Katsunori; Ishida, Shoichi; Onozato, Shogo; Washiyama, Hirofumi; Uehara, Yohei; Saito, Yoshimoto; Okamoto, Kazutoshi; Sakamoto, Shingo; Sugimoto, Yusuke; Terayama, Kei

doi:10.1038/s41598-024-77893-7

Download PDF

Article
Open access
Published: 02 November 2024

Development of a method for estimating asari clam distribution by combining three-dimensional acoustic coring system and deep neural network

Tokimu Kadoi¹,
Katsunori Mizuno²,
Shoichi Ishida¹,
Shogo Onozato²,
Hirofumi Washiyama³,
Yohei Uehara³,
Yoshimoto Saito⁴,
Kazutoshi Okamoto⁴,
Shingo Sakamoto⁵,
Yusuke Sugimoto⁵ &
…
Kei Terayama^1,6,7

Scientific Reports volume 14, Article number: 26467 (2024) Cite this article

2316 Accesses
Metrics details

Subjects

Abstract

Developing non-contact, non-destructive monitoring methods for marine life is crucial for sustainable resource management. Recent monitoring technologies and machine learning analysis advancements have enhanced underwater image and acoustic data acquisition. Systems to obtain 3D acoustic data from beneath the seafloor are being developed; however, manual analysis of large 3D datasets is challenging. Therefore, an automatic method for analyzing benthic resource distribution is needed. This study developed a system to estimate benthic resource distribution non-destructively by combining high-precision habitat data acquisition using high-frequency ultrasonic waves and prediction models based on a 3D convolutional neural network (3D-CNN). The system estimated the distribution of asari clams (Ruditapes philippinarum) in Lake Hamana, Japan. Clam presence and count were successfully estimated in a voxel with an ROC-AUC of 0.9 and a macro-average ROC-AUC of 0.8, respectively. This system visualized clam distribution and estimated numbers, demonstrating its effectiveness for quantifying marine resources beneath the seafloor.

Deep-sea infauna with calcified exoskeletons imaged in situ using a new 3D acoustic coring system (A-core-2000)

Article Open access 27 July 2022

Counting manatee aggregations using deep neural networks and Anisotropic Gaussian Kernel

Article Open access 13 November 2023

Ultrasonic antifouling devices negatively impact Cuvier’s beaked whales near Guadalupe Island, México

Article Open access 22 September 2022

Introduction

The seafloor harbors numerous benthic organisms, which are indispensable as marine resources targeted by fisheries and for marine ecosystems and material cycles^1,2. However, as most benthic organisms are always concealed in sediments, determining their population size and observing their behavior is difficult. Surveys are inevitably destructive, time-consuming, and costly. Conventional sampling methods cannot cover large areas, making it difficult to detect changes over time. This limitation is an impediment to sustainable management of sub-benthic resources and environment.

Benthic organisms such as asari clam (Japanese littleneck clam, Ruditapes philippinarum) have become a concern regarding stock management³. The asari clam is a bivalve that lives in the shallows of inner bays and is an important target fishery species. The harvest of asari clams in Japan has been on a downward trend, with a significant decrease from 160,000 tons in the 1980s to less than 10,000 tons since 2016. Various factors have been revealed as causes for this trend, including overfishing, feeding damage, disease and insect damage, and the prey environment⁴. Monitoring of clams has recently been conducted with the aim of sustainable fisheries management and resource conservation; however, this relies on manual digging and counting⁵. The current methods are very costly, and comprehensively monitoring the dynamics trends in the asari clam ecosystem over time is difficult.

Various monitoring methods have recently been developed by linking technological improvements in image and acoustic data acquisition with machine learning analysis, including deep learning^6,7,8,9,10. For example, two-dimensional (2D) imaging of tracer particles¹¹ and three-dimensional (3D) computed tomography (CT) imaging¹² have been developed as data acquisition methods under the seafloor. Acoustic systems with various operating frequencies are commonly used to detect objects buried in seafloor sediments. For instance, wooden wrecks buried on the seafloor can be visualized using chirp signals with sweep pulses ranging from 1.5 to 13 kHz¹³. Recently, a new monitoring tool, a 3D acoustic coring system, has been developed and used to precisely survey buried roots of plants with an outer diameter of 3 to 5 cm using ultrasonic waves with a center frequency of 100 kHz¹⁴. In addition, high-frequency signals with a center frequency of 1 MHz have recently been used to survey small creatures of 3–5 cm, such as the clams¹⁵. Other deep learning analysis methods have been proposed for monitoring seafloor data, such as corals and seaweeds^6,7,8. Furthermore, attempts have been made to classify organisms under the seafloor in the laboratory from data acquired with the 3D acoustic coring system⁹. However, such analysis methods need to be verified outside of the laboratory for the practical use of the 3D acoustics to estimate the distribution of targets in the obtained 3D data and the validation of such methods for benthic resource managements.

In this study, we developed a system to estimate the distribution of asari clams in a non-contact and non-destructive manner. Our system first acquires 3D benthic data containing clams using the 3D acoustic coring system. Then, the system predicts whether and how many clams are present in a local voxel region using a 3D convolutional neural network (3D-CNN)¹⁶, a method that has been used successfully in a wide range of tasks involving 3D data^{10,17,18,19,20,21,22}. The proposed system can also estimate the distribution and number of targets by integrating the prediction results at multiple voxels within a certain region. To validate the proposed system, benthic organisms, such as clams and mussels, were obtained to verify the estimation. We also reported examples of the distribution visualization and estimation results of the count distribution within a certain region. Furthermore, we showed that Gradient-weighted Class Activation Mapping (Grad-CAM)²³, a visual explanation method for interpretation in the prediction of neural network models, can be used to analyze and interpret regions that are important for the prediction of 3D data.

Results and discussion

Overview

The workflow of this study is shown in Fig. 1. The proposed system was divided into two parts: 1) preparation of input data for the deep learning model and 2) prediction by two 3D-CNN models. The 3D-CNN models predicted the presence or absence of clams and the count of clams in the local voxel data.

Model for classifying the presence or absence of clams

First, we constructed the 3D-CNN model to discriminate the presence or absence of clams from the 3D data measured with the A-core-2000. The model training and evaluation were conducted using Stratified Group 5-fold cross-validation with the aforementioned data types A and C, including data types M containing mussels and sand and AM containing clams, mussels, and others (Fig. 2(a)). Figure 2(b) and (c) shows the model’s performance using ROC-AUC curves and confusion matrices. The prediction achieved an ROC-AUC of 0.90, an accuracy of 0.87, and an F1 score of 0.87. The detailed predictions for each data are shown in Fig. 2(d). Data type A containing only clams consistently exhibited high predictive accuracy, with accuracy exceeding 0.8 across all buckets. In contrast, AM and M showed several buckets with accuracy below 0.80, indicating lower overall accuracy than A. Additionally, most of the data in C showed high accuracy, except for C1, which had a value below 0.8. Predictions using only the intensity information of each voxel without 3D-CNN were also performed for comparison. Supplementary Fig. 1(a) and (b) shows the classification results using the mean and maximum intensity values for each voxel, respectively, with very low ROC-AUC values of 0.51 and 0.53. These results indicate that the presence or absence can be estimated with a certain level of accuracy using 3D-CNN.

We used Grad-CAM²³ to analyze where the prediction model focuses on in a 3D acoustic image to determine the presence or absence of clams. Grad-CAM enables the visualization of regions that constitute the basis for the predictions made by the model. Figure 3 shows the original images and the Grad-CAM visualizations averaged every 10 pixels in the z-direction. The upper rows of Fig. 3 show examples of successful predictions visualized by Grad-CAM. Figure 3(a) and (b) are cases in which clams are present in the voxel, and (c) and (d) are cases in which clams are absent. Figure 3(a) and (b) shows that when the model predicted the voxel as the presence of clams, it responded strongly to signals that appeared to be clams in relatively deep areas. In contrast, the model focused on shallow areas when predicting the voxel as the absence of clams. The lower row of Fig. 3 shows examples of failed predictions, whereas (e) and (f) represent cases where clams were predicted to be present even though there were no clams. Relatively strong signals were observed in the acoustic images; thus, the model responded to these signals and predicted the voxel as the presence of clams. As observed in (i) and (j) in Fig. 3, even C, which contained only sand, contained small clams and stones, which may have adversely affected the prediction. Although signals that appeared to be clams were present in the acoustic images and were focused by the model to some extent, the predictions were incorrect. These results suggest that although there remains room for improvement in accuracy, the prediction model focuses on signals that appear to be clams and the prediction of the presence of clams.

Model for classifying the count of clams

Next, the model for estimating the count of clams was trained and evaluated. The model training and evaluation were conducted through Stratified Group 5-fold cross-validation. The output was performed as a 3-class classification: “0”, “1”, and “2 or more.” Fig. 4(a) shows the prediction examples for each data type. Figure 4(b) and (c) shows the model’s performance using ROC-AUC curves and confusion matrix. The prediction achieved a macro-average ROC-AUC of 0.90, an accuracy of 0.64, and a macro-average F1 score of 0.64. The confusion matrix indicated that the discrimination accuracy of “1” and “2 or more” was lower than the prediction accuracy of “0”. This is probably because predicting the count of clams becomes difficult when the reflections overlap due to the high density of clams. The prediction results for each data type regarding accuracy are also shown in Fig. 4(d). The accuracy of data type C, which contained only sand, was high, whereas that of data types A and M was slightly low. The accuracy of data type AM, containing various objects other than clams, was further reduced. This result suggests that small stones or clams may make classification more difficult, as discussed in the model predicting presence.

Estimating the distribution of clams

The proposed method enabled us to estimate the distribution of clams (Fig. 2(a) and 4(a)) because the trained models predicted the presence and count of clams in each voxel. Furthermore, the count of clams within a certain region containing multiple voxels was estimated by integrating the prediction results. Here, we evaluated the accuracy of estimating the clams contained within each bucket’s region, consisting of 20 voxels (Fig. 5). The estimated and measured counts for each bucket are listed in Supplementary Table 1. The overall correlation coefficient was 0.92, and the correlation coefficient for only the A and AM data was 0.68, confirming the correlations of the clam counts. The mean absolute error (MAE) and mean relative error (MRE) for each data type are presented in Table 1. The MRE values were 0.20 for A and 0.12 for AM, indicating that the estimation is possible with an error of approximately 10–20%. These results suggest that the count of clams in each region can also be estimated, although with some error.

Table 1 MAE (mean absolute error) and MRE (mean relative error) for predicting the count of clams for each data type: control data containing sand only (C), data containing asari clams and sand (A), data containing a mixture of mussels and sand (M), and data containing a mixture of clams, mussels, and other materials (AM). The count of clams was estimated for each bucket using the prediction model of the count of clams. The MAE in the count of clams was calculated for each data type. MRE was calculated only for A and AM because relative absolute error cannot be calculated when the count of clams is zero.

Full size table

Conclusion

In this study, we prepared and measured benthic data, including clams, to validate the resource survey method using the three-dimensional acoustic coring system. Using the measured data, two prediction models based on 3D-CNN were trained and evaluated to estimate the presence and count of clams. We successfully estimated the presence or absence of clams in each voxel with an ROC-AUC of approximately 0.9 and the count of clams in local areas with a macro-average ROC-AUC of 0.8. The count of clams within a certain area (bucket) was confirmed to be estimated by MAE values of 0.12 or 0.20. These results indicate the potential of the proposed method for estimating benthic resources.

Unlike conventional sampling methods that involve drilling and disturbance of the marine environment, this method does not require physical interference with the ecosystem, thus minimizing potential harm to other marine species and maintaining habitat integrity. This reduced environmental impact is a significant advantage over traditional methods and would be a more sustainable option for resource surveys. Additionally, while traditional methods have often difficulties grasping spatial distribution and temporal changes, the approach proposed in this study allows for the observation of these aspects.

However, this study has the potential for improvement. First, the prediction accuracy tended to decrease when factors other than clams were introduced, such as AM. To solve this problem, it is beneficial to prepare a larger amount of various data and to use more advanced neural networks, such as vision transformers, to process voxel data for learning. Because the measurement speed of data by A-core-2000 is slow and the measurement area is relatively narrow, considerably faster measuring devices (array sonar system) are needed. As this study was conducted in a relatively controlled environment, verification in the outside field is desired in the future.

Methods

Data preparation and measurement using the acoustic coring system

Preparation of benthic data containing clams

Benthic data, including clams, were obtained from a portable pool at the Hamanako Branch, Shizuoka Prefectural Research Institute of Fishery, Shizuoka Prefecture, Japan. Four types of benthic data were prepared to verify whether estimation could be performed for various types of data: control data containing sand only (C), data containing asari clams and sand (A), data containing a mixture of mussels and sand (M), and data containing a mixture of clams, mussels, and other materials (AM). Each benthic data was prepared by placing sand and mussels from Lake Hamana in a rectangular bucket with internal dimensions of 580 × 155 × 200 mm. Six buckets were prepared for each data type. For the A and AM data, 40 live clams from the nearby Mikawa Bay were placed in each bucket three days before the measurement date and stored until the measurement date.

Measurement using the acoustic coring system

A convergent ultrasonic sensor, the A-core-2000²⁴ was used to observe benthic data in the prepared buckets (Fig. 1(a)). The bucket’s 250 × 200 mm area was scanned at 2 mm intervals while continuously irradiated with ultrasound, and the reflected sound wave was measured and recorded. Square pulse with a central frequency of 500 kHz was generated by a pulsar receiver in the acoustic unit and applied to the focus probe. The pulse repetition interval was 200 ms. The distance of focal point, beam width, and focal depth were around 70 mm, 4 mm, and 30 mm, respectively. The median grain size of sands which we used in this study was 2.6 $\:\times\:$ 10⁻⁴ and thus the attenuation coefficient was estimated 10² dB/m according to the analysis in the previous study²⁴. In this study, the absorption attenuation was considered dominant because the ratio $\:k$d ($\:\text{k}\:=\:\text{w}\text{a}\text{v}\text{e}\text{n}\text{u}\text{m}\text{b}\text{e}\text{r}$, $\:\text{d}\:=\:\text{g}\text{r}\text{a}\text{i}\text{n}\:\text{s}\text{i}\text{z}\text{e}$) was 8.6 $\:\times\:\:$ 10⁻² and this value is quite small (kd$\:\ll\:$1)²⁵. Envelope processing was performed on the recorded waveform data. The acoustic image constructed by the Viewer²⁴ showed that the reflection from the clams was between reflections from the soil surface and reflection from the soil surface again (the 2nd reflection in Fig. 1(b)). In this study, the range of analysis was defined as the area between the first and second reflections (multiple reflections) from the sediment surface. The total reflection intensity in the XY plane was calculated for each z-coordinate in the 3D data of the acquired reflection intensity, the region of clam presence was extracted, and the size was standardized for each bucket (Sl). Therefore, the 3D data of reflectance intensity measured at 125 × 100 × 693 points in each bucket were used in this study.

Manual identification of positions of asari clams

The positions of asari clams in the measured data were identified to evaluate the prediction models. First, x and y coordinates, the candidate positions of asari clams in the horizontal direction, were manually obtained from the reflection intensity data. Next, the positions of the asari clams were identified by comparing them with the image data of the dug clams taken at the time of measurement. The backscatter of the clams was measured, and the center position of the backscatter was defined as the clam position. The data on the positions of the clams were recorded in Supplementary Tables 2 and 3. The data in A5, C4, and AM5 were excluded from the data set because mechanical issues with the probe resulted in excessive noise, rendering the measurements inaccurate.

Preparation of datasets for model training and evaluation

Datasets 1 and 2 were created to construct models to predict the presence or absence of clams in a local voxel and the count of clams in a local voxel. First, the data obtained from each bucket was divided into 25 × 25 pixels, shifting every 1 pixel in the horizontal direction, resulting in 124,689 local voxel data, each with size 25 × 25 × 693 pixels (Fig. 1(b)). Here, backscatter data of clams could be included in the boundary of the divided data. Therefore, if the position of a clam recorded above was included in a local voxel or the shortest distance of the position from a local voxel was less than 11 pixels, the local voxel data was regarded as including the clam. Based on the positions of clams identified in the above Section, for Dataset 1, a label of “Absence” was assigned if no clams were included in a local voxel data, and “Presence” was assigned if at least one clam was included in a local voxel data. The number of labeled data is presented in Table 2; for C and M, all data were labeled as “absence” because no clams were included in the buckets. Based on the positions of clams identified in the above Sect. 3.1.3, for Dataset 2, the count of clams was assigned as a label (Table 3). The number of data containing three or more clams in one local voxel was very small (1,137); therefore, data containing three or more clams were treated as data containing two or more clams. Datasets 1 and 2 were created by randomly extracting data from the 124,689 local voxel data, each with a size of 25 × 25 × 693 pixels; thus, the number of data for each label was approximately the same.

Table 2 Dataset regarding the presence or absence of clams (Dataset 1). Data type: control data containing sand only (C), data containing asari clams and sand (A), data containing a mixture of mussels and sand (M), and data containing a mixture of clams, mussels, and other materials (AM).

Full size table

Table 3 Dataset on the count of clams (Dataset 2). Data type: control data containing sand only (C), data containing asari clams and sand (A), data containing a mixture of mussels and sand (M), and data containing a mixture of clams, mussels, and other materials (AM).

Full size table

Classification using deep neural networks

In this study, we employed the 3D-CNN¹⁶ to develop the prediction models. 3D-CNN has demonstrated success in various tasks involving 3D data, such as classification, detection, and segmentation of medical images^{10,17,18,19,20}, as well as action recognition^21,22. It mainly consists of convolutional, pooling, and fully connected layers. We modified the 3D-CNN developed in our previous study⁹ to build two independent models: one for classifying the presence or absence of clams and another for classifying the count of clams. In the model for classifying the presence or absence of clams, we used Dataset 1, containing information on the presence or absence of clams as input and obtained outputs of either ‘Absence’ or ‘Presence’ (Fig. 6). The model employed the Adam optimizer, with a batch size of three and a learning rate of 10⁻⁶, and was trained for a maximum of 50 epochs, stopping if there was no improvement for 10 epochs. For the model classifying the count of clams, we used Dataset 2, containing information on the count of clams as input, and obtained outputs of ‘0’, ‘1’, or ‘2 or more clams’ (Fig. 6). The model employed the Adam optimizer, with a batch size of three and a learning rate of 10⁻⁶, and was trained for a maximum of 30 epochs, stopping if there was no improvement for 10 epochs.

In this study, TensorFlow (version 2.4.1)²⁶ was used to construct two 3D-CNN models, and the performance of the models was evaluated using Stratified Group 5-fold cross-validation²⁷. The groups represented each bucket. Additionally, the dataset was divided into training (60%), validation (20%), and test (20%) sets.

Evaluation metrics of the prediction model

Three evaluation metrics were employed to evaluate the trained models: accuracy, F1-score, and the area under the receiver operating characteristic curve (ROC-AUC). Accuracy represents the proportion of correctly predicted data, and this metric is the most basic measure to evaluate the overall performance of a classification model. F1-score is defined as the harmonic mean of precision and recall:

$$\:F1-score=\frac{2\cdot\:precision\cdot\:recall}{precision+recall}$$

.

The precision indicates the percentage of data predicted to be positive that is actually positive, and the recall indicates the percentage of actual positive data that is predicted to be positive. The F1 score was employed to evaluate the balance between precision and recall. ROC-AUC refers to the area under the ROC curve, which plots the true-positive rate against the false-positive rate as the classification threshold varies. ROC-AUC was employed to evaluate how well the model was able to detect the presence of clams while allowing for certain errors, balancing true positive and false positive rates at different thresholds. Each metric ranges from 0 to 1, with higher values indicating better model performance.

Furthermore, to evaluate the model for classifying the count of clams, we used macro-average ROC-AUC and F1-score, which are the averages of the ROC-AUC and F1-score calculated for each class. Macro-average refers to the method of computing the ROC-AUC and F1-score for each class individually and then averaging these scores. This metric considers the performance of each class equally and allows for a balanced evaluation of the overall performance, even in cases with class imbalance.

Data availability

The acoustic voxel data have been archived on Zenodo. You can access the datasets using the following links: https://zenodo.org/records/13381836, https://zenodo.org/records/13377864, https://zenodo.org/records/13381893.

References

Solan, M. et al. Extinction and ecosystem function in the marine benthos. Sci. (1979). 306, 1177–1180 (2004).
CAS Google Scholar
Danise, S., Twitchett, R. J., Little, C. T. S. & Clémence, M. E. The impact of global warming and anoxia on marine benthic community dynamics: an example from the Toarcian (Early Jurassic). PLoS One 8, e56255 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ito, H. What kind of animal is the clam Ruditapes philippinarum? –Introduction to its ecology and fishery. Asari to wa donna ikimono ka: Asari no seitai, oyobi gyogyou seisan no suii (in Japanese). Jpn J. Benthol. 57, 134–138 (2002).
Article Google Scholar
Toba, M. Revisiting recent decades of conflicting discussions on the decrease of Asari clam Ruditapes philippinarum in Japan: A review. Asari shigen no genshou ni kansuru giron e no saihou (in Japanese). Nippon Suisan Gakkaishi. 83, 914–941 (2017).
Article Google Scholar
Murai, M. Trends and considerations on the variation in the number of clams at ‘Umi no Kouen.’ 「Umi no Kouen」 ni okeru Asari kotaisuu no hendou ni kansuru keikou to kousatsu (in Japanese). Enkangiki Gakkaishi (Journal Coastal. Zone Studies). 32, 19–30 (2019).
Google Scholar
Wang, S. et al. An efficient segmentation method based on semi-supervised learning for seafloor monitoring in Pujada Bay, Philippines. Ecol. Inf. 78, (2023).
Terayama, K. et al. Cost-effective seafloor habitat mapping using a portable speedy sea scanner and deep-learning-based segmentation: A sea trial at Pujada Bay, Philippines. Methods Ecol. Evol. 13, 339–345 (2022).
Article Google Scholar
Mizuno, K. et al. An efficient coral survey method based on a large-scale 3-D structure model obtained by Speedy Sea Scanner and U-Net segmentation. Sci. Rep. 10, (2020).
Mizuno, K., Terayama, K., Ishida, S., Godbold, J. A. & Solan, M. Combining three-dimensional acoustic coring and a convolutional neural network to quantify species contributions to benthic ecosystems. R Soc. Open. Sci. 11, (2024).
Gu, Y. et al. Automatic lung nodule detection using a 3D deep convolutional neural network combined with a multi-scale prediction strategy in chest CTs. Comput. Biol. Med. 103, 220–231 (2018).
Article PubMed Google Scholar
Solan, M. et al. In situ quantification of bioturbation using time-lapse fluorescent sediment profile imaging (f-SPI), luminophore tracers and model simulation. Mar. Ecol. Prog Ser. 271, 1–12 (2004).
Article ADS Google Scholar
Hale, R. et al. High-resolution computed tomography reconstructions of invertebrate burrow systems. Sci. Data 2, (2015).
Plets, R. M. K. et al. The use of a high-resolution 3D Chirp sub-bottom profiler for the reconstruction of the shallow water archaeological site of the Grace Dieu (1439), River Hamble, UK. J. Archaeol. Sci. 36, 408–418 (2009).
Article Google Scholar
Mizuno, K. et al. Automatic non-destructive three-dimensional acoustic coring system for in situ detection of aquatic plant root under the water bottom. Case Stud. Nondestructive Test. Evaluation. 5, 1–8 (2016).
Article MathSciNet CAS Google Scholar
Suganuma, H., Mizuno, K. & Asada, A. Application of wavelet shrinkage to acoustic imaging of buried asari clams using high-frequency ultrasound. Jpn J. Appl. Phys. 57, 07LG08 (2018).
Article Google Scholar
Ji, S., Xu, W., Yang, M. & Yu, K. 3D Convolutional Neural Networks for Human Action Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35, 221–231 (2013).
Article PubMed Google Scholar
Kamnitsas, K. et al. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med. Image Anal. 36, 61–78 (2017).
Article PubMed Google Scholar
Zhou, J. et al. Weakly supervised 3D deep learning for breast cancer classification and localization of the lesions in MR images. J. Magn. Reson. Imaging. 50, 1144–1151 (2019).
Article PubMed Google Scholar
Dou, Q., Chen, H., Yu, L., Qin, J. & Heng, P. A. Multilevel Contextual 3-D CNNs for False Positive Reduction in Pulmonary Nodule Detection. IEEE Trans. Biomed. Eng. 64, 1558–1567 (2017).
Article PubMed Google Scholar
Yang, C., Rangarajan, A. & Ranka, S. Visual Explanations From Deep 3D Convolutional Neural Networks for Alzheimer’s Disease Classification. AMIA Annu. Symp. Proc. 2018, 1571–1580 (2018).
PubMed PubMed Central Google Scholar
Molchanov, P. et al. Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks. in. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 4207–4215 (IEEE, (2016). https://doi.org/10.1109/CVPR.2016.456
Huang, J. & Zhou, W. Houqiang Li & Weiping Li. Sign Language Recognition using 3D convolutional neural networks. in IEEE International Conference on Multimedia and Expo (ICME) 1–6 (IEEE, 2015). doi: (2015). https://doi.org/10.1109/ICME.2015.7177428
Selvaraju, R. R. et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. in. IEEE International Conference on Computer Vision (ICCV) 618–626 (IEEE, 2017). (2017). https://doi.org/10.1109/ICCV.2017.74
Mizuno, K., Nomaki, H., Chen, C. & Seike, K. Deep-sea infauna with calcified exoskeletons imaged in situ using a new 3D acoustic coring system (A-core-2000). Sci. Rep. 12, (2022).
Mizuno, K., Cristini, P., Komatitsch, D. & Capdeville, Y. Numerical and Experimental Study of Wave Propagation in Water-Saturated Granular Media Using Effective Method Theories and a Full-Wave Numerical Simulation. IEEE J. Oceanic Eng. 45, 772–785 (2020).
Article ADS Google Scholar
Abadi, M. et al. TensorFlow: a system for large-scale machine learning. 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16) USENIX Association. 265–283 (2016).
Scikit-learn User Guide, 3.1. Cross-validation: evaluating estimator performance. Accessed July 24, (2024). https://scikit-learn.org/stable/modules/cross_validation.html

Download references

Acknowledgements

This work was supported by KAKENHI (20H02362, 20K15587, and 20KK0238).

Author information

Authors and Affiliations

Graduate School of Medical Life Science, Yokohama City University, 1-7-29, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
Tokimu Kadoi, Shoichi Ishida & Kei Terayama
Department of Environment Systems, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwanoha, Kashiwa, Chiba, 277-8561, Japan
Katsunori Mizuno & Shogo Onozato
Shizuoka Prefectural Research Institute of Fishery and Ocean, 5005-3, Bentenjima, Maisaka-cho, Chūō-ku, Hamamatsu-shi, Shizuoka, 431-0214, Japan
Hirofumi Washiyama & Yohei Uehara
Marine Open Innovation Institute, 2nd Floor, Shimizu Marine Building, 9-25, Hinode-cho, Shimizu-ku, Shizuoka-shi, Shizuoka, 424-0922, Japan
Yoshimoto Saito & Kazutoshi Okamoto
Windy Network Corporation, 1-19-4, Higashi-Hongo, Shimoda-shi, Shizuoka, 415-0035, Japan
Shingo Sakamoto & Yusuke Sugimoto
RIKEN Center for Advanced Intelligence Project, 1-4-1, Nihonbashi, Chuo-ku, Tokyo, 103-0027, Japan
Kei Terayama
MDX Research Center for Element Strategy, Tokyo Institute of Technology, 4259, Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa, 226-8501, Japan
Kei Terayama

Authors

Tokimu Kadoi
View author publications
Search author on:PubMed Google Scholar
Katsunori Mizuno
View author publications
Search author on:PubMed Google Scholar
Shoichi Ishida
View author publications
Search author on:PubMed Google Scholar
Shogo Onozato
View author publications
Search author on:PubMed Google Scholar
Hirofumi Washiyama
View author publications
Search author on:PubMed Google Scholar
Yohei Uehara
View author publications
Search author on:PubMed Google Scholar
Yoshimoto Saito
View author publications
Search author on:PubMed Google Scholar
Kazutoshi Okamoto
View author publications
Search author on:PubMed Google Scholar
Shingo Sakamoto
View author publications
Search author on:PubMed Google Scholar
Yusuke Sugimoto
View author publications
Search author on:PubMed Google Scholar
Kei Terayama
View author publications
Search author on:PubMed Google Scholar

Contributions

T. K.: data acquisition, data curation, formal analysis, investigation, software, methodology, writing—original draft; K.M.: conceptualization, data acquisition, data curation, investigation, methodology, project administration, resources, writing—review and editing; S.I.: methodology, resources, software, validation, writing—review and editing; S. O.: data acquisition, investigation; H. W.: data acquisition, investigation, conceptualization; Y. U.: data acquisition, investigation; Y. Saito: data acquisition, investigation; K. O.: data acquisition, investigation; S. S.: data acquisition, investigation; Y. Sugimoto: data acquisition, investigation; K.T.: formal analysis, funding acquisition, software, validation, writing—original draft, review and editing.

Corresponding authors

Correspondence to Katsunori Mizuno or Kei Terayama.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kadoi, T., Mizuno, K., Ishida, S. et al. Development of a method for estimating asari clam distribution by combining three-dimensional acoustic coring system and deep neural network. Sci Rep 14, 26467 (2024). https://doi.org/10.1038/s41598-024-77893-7

Download citation

Received: 30 August 2024
Accepted: 25 October 2024
Published: 02 November 2024
Version of record: 02 November 2024
DOI: https://doi.org/10.1038/s41598-024-77893-7