Biophysical and structural studies of fibulin-2

Sohail, Anil A.; Koski, M. Kristian; Ruddock, Lloyd W.

doi:10.1038/s41598-024-64931-7

Download PDF

Article
Open access
Published: 02 July 2024

Biophysical and structural studies of fibulin-2

Anil A. Sohail¹,
M. Kristian Koski^1,2 &
Lloyd W. Ruddock^1,2

Scientific Reports volume 14, Article number: 15091 (2024) Cite this article

2433 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Fibulin-2 is a multidomain, disulfide-rich, homodimeric protein which belongs to a broader extracellular matrix family. It plays an important role in the development of elastic fiber structures. Malfunction of fibulin due to mutation or poor expression can result in a variety of diseases including synpolydactyly, limb abnormalities, eye disorders leading to blindness, cardiovascular diseases and cancer. Traditionally, fibulins have either been produced in mammalian cell systems or were isolated from the extracellular matrix, a procedure that results in poor availability for structural and functional studies. Here, we produced seven fibulin-2 constructs covering 62% of the mature protein (749 out of 1195 residues) using a prokaryotic expression system. Biophysical studies confirm that the purified constructs are folded and that the presence of disulfide bonds within the constructs makes them extremely thermostable. In addition, we solved the first crystal structure for any fibulin isoform, a structure corresponding to the previously suggested three motifs related to anaphylatoxin. The structure reveals that the three anaphylatoxins moieties form a single-domain structure.

Microfibril-associated glycoprotein 4 forms octamers that mediate interactions with elastogenic proteins and cells

Article Open access 13 May 2024

Donor-strand exchange drives assembly of the TasA scaffold in Bacillus subtilis biofilms

Article Open access 18 November 2022

Molecular histopathology of matrix proteins through autofluorescence super-resolution microscopy

Article Open access 08 May 2024

Introduction

Fibulin-2 is one of eight members of the fibulin family found in mammals¹. Fibulin-1, the first member of the fibulin family, was distinguished from other basement membrane proteins by having two distinct cysteine rich regions². The first region has close homology to anaphylatoxin-like modules, while the second has homology to epidermal growth factor (EGF)-like domains³. Fibulin-2 was subsequently recognized as a member of the fibulin family^4,5 and shows approximately 45% sequence identity with fibulin-1^6,7,8,9. Fibulin-2 also contains a unique N-domain, which shows no sequence similarity with other extracellular matrix (ECM) proteins.

Mouse fibulin-2 has 83.7% identity with the human protein and can be divided into three regions 1–3 (Fig. 1a)^4,5. The N-domain (V27-T434) can be further divided into sub-domains comprised of N_a (V27-C176) and N_b (H177-T434). N_a is a cysteine rich sub-domain having 22 cysteines whereas the N_b sub-domain lacks cysteines. The three anaphylatoxin-like modules (C435-C543) following the N-domain are also found in fibulin-1, but not in other fibulins. These modules have a total of 17 cysteines which are predicted to form 8 disulfide bonds along with an unpaired cysteine (C500)^4,10. Region 2 of fibulin-2 is comprised of eleven EGF-like domains having in total 66 cysteines, with each domain having 6 conserved cysteine residues (Fig. 1b)⁴. Two unassigned sequences (E544-D593 and P636-P668) are found before and after the first EGF-like domain (D594-R635). The third EGF-like domain (D709-V755) in the mouse protein is absent in human fibulin-2. Ten of the EGF-like domains of mouse fibulin-2 are reported to be calcium binding domains (cEGF) having a consensus sequence D-x-D/N-E before the first cysteine (highlighted pink in Fig. 1b)¹¹. These residues are not found in the second EGF-like domain (Q669-E708). The C-terminal region known as Domain III (R1107-P1221) has two cysteines and is ubiquitous in the fibulin family¹².

According to electron microscopy studies, the overall shape and size of full-length fibulin-2 is different from that of other fibulins¹³. This could be due to the presence of the unique N-domain of region 1 and/or it being the only homodimer fibulin isoform. It has been reported that the free unpaired cysteine residue (C500) present in the region 1 in the second anaphylatoxin-like module plays a role in stabilizing anti-parallel homodimer formation¹⁰. Additionally, non-covalent interactions between the N-domain and the region 2 EGF-like domains also assist in dimerization^4,10.

Complex formation between fibulin-2 and other ECM proteins including aggregan, versican, brevican¹⁴, fibrillin-1¹⁵, laminins^16,17, nidogens¹⁸, perlecan¹⁹ and tropoelastin^20,21, have been reported. These complexes play important roles in many biological processes. Fibulin expression is found during the development of elastic fiber structures such as cartilages, cardiac valves, and blood vessels⁸. Malfunction or low fibulin expression may result in several pathological processes, including synpolydactyly and limb abnormalities²², eye disorders leading towards blindness²³, cardiovascular diseases and cancer²⁴.

Currently, there are no crystal structures reported for any fibulin family members, but there is an unpublished nuclear magnetic resonance (NMR) structure for the EGF-like 1 domain of human fibulin-4 (PDB code 2KL7). This lack of structural information may arise due to the high level of post-translational modifications in the protein, in particular the high disulfide density. To our knowledge, here we report for the first time the soluble production of mouse fibulin-2 constructs using an Escherichia coli (E. coli) production system. The constructs made covered 62% of the mature protein (749 out of 1195 residues) and include 42 of the predicted 53 disulfide bonds in the full-length protein. Purified constructs were subjected to biophysical characterization. The first crystal structure within the fibulin family, of the three anaphylatoxin-like modules of mouse fibulin-2, was solved.

Results and discussion

Construction and production

Previously, we have successfully produced large and complex disulfide-bond containing ECM proteins using CyDisCo (cytoplasmic disulfide bond formation in E. coli) technology²⁵. This encouraged us to attempt to produce full-length mature fibulin-2 (V27-P1221). Unfortunately, the soluble production of the full-length construct was not successful in this system and therefore smaller constructs were made (Table 1), to try to identify the limiting factor(s) for their production.

Table 1 Plasmids expressing constructs of mouse fibulin-2 used in this study.

Full size table

All constructs which included the N-domain of fibulin-2 (V27-G545, V27-D593 and V27-P1221) did not make soluble protein. AlphaFold prediction^26,27 suggests that the N_b sub-domain (H177-T434) of fibulin-2 is unstructured and disordered (supplementary Fig. S1a). From this we hypothesize that this region may interact with some other ECM protein(s) and that such an interaction may be required to stabilize the structure of the N-domain and hence allow soluble production. Constructs containing EGF-like domains were partially solubly expressed, resulting in purified yields in the range 0.1–1.0 mg/L. These production levels are significantly lower than the levels we have observed for other EGF-like containing ECM proteins such as region 3 of perlecan²⁵. These differences in protein yield support the idea that protein expression in E. coli may be highly dependent on the exact nature of protein of interest and/or dependent on the nature of inter-domain packing in the native protein. In contrast to the low yields of other constructs, the fibulin-2 construct S427-G545 (wild-type and C500L mutant), which has three anaphylatoxin-like modules, was fully solubly produced and was purified in good yields (> 10 mg/L). This construct is predicted to be a disulfide linked homodimer¹⁰ with a molecular weight of ~ 27.6 kDa and a total of 34 cysteines forming 17 disulfide bonds. Overall, the seven constructs (shown in Table 1) that could be produced as (partly or wholly) soluble proteins covered 62% of fibulin-2, and included 79.3% of the total disulfide bonds.

Biophysical studies

The apparent molecular weights of all the fibulin-2 purified constructs were analyzed using sodium dodecyl sulfate—polyacrylamide gel electrophoresis (SDS-PAGE). Both the reduced and N-ethylmaleimide (NEM) treated non-reduced purified samples were run on 15% SDS-PAGE gels (Fig. 2). All the EGF-like domain containing constructs, except for E979-L1063, showed a single band for both reduced and non-reduced conditions (Fig. 2a). The presence of multiple bands for E979-L1063 suggests that the construct is prone to degradation and/or modification upon storage in SDS loading buffer as samples run immediately after purification showed a single band. All the bands (except the degradation products of E979-L1063) ran at their expected molecular size in reducing SDS-PAGE and the single band observed in the non-reduced NEM treated samples implies that the constructs all have a single redox state. For the anaphylatoxin-like modules containing construct (S427-G545), a single band near ~ 16 kDa could be seen for the wild-type protein in the reduced state, with a shift in mobility in non-reduced SDS-PAGE indicative of inter-molecular disulfide-based dimerization (Fig. 2b). The apparent molecular weight for the dimer is not twice that of the monomer, probably due to the predicted intramolecular disulfides in each subunit. To confirm this, the C500L mutation was made. This ran at the same position as the wild-type protein in reducing SDS-PAGE, but at a lower molecular weight in non-reducing SDS-PAGE (Fig. 2b). This is consistent with it lacking an intermolecular disulfide which keeps the protein as a homodimer in SDS-PAGE, while retaining intra-molecular disulfides. This implies the C500L mutation disrupts the formation of the inter-molecular disulfide bond in the dimer and that the wild-type protein is all in a disulfide linked homodimer state.

To validate the proteins made and to examine their redox states, the determination of exact molecular weight of the fibulin-2 constructs was done by mass spectrometry (MS) (Table 2). The MS results confirmed that the purified fibulin-2 constructs have the expected molecular weight with all the cysteines present in the constructs being in disulfide bonds. No significant protein adducts having an additional 125 Da molecular weight (or multiple thereof) was seen for any of the NEM-treated samples. This further implies that all the cysteines in the constructs are involved in disulfide bonds. The molecular weight of the C500L mutant and wild-type of the fibulin-2 construct containing three anaphylatoxin-like modules (S427-G545) indicated the monomeric and disulfide-linked dimeric states, respectively.

Table 2 Molecular weight analysis by mass spectrometry for purified fibulin-2 constructs.

Full size table

SEC-MALS analysis was then done, to further investigate the oligomeric states of the S427-G545 constructs (wild-type and C500L mutant). This analysis showed that both the wild-type and C500L mutant eluted in the same volume in SEC. They had an apparent molecular weight of 26.3 kDa and 25.7 kDa (wild-type and C500L mutant, respectively) according to MALS, indicating a dimeric state of both proteins (supplementary Fig. S2). This is in agreement with the study by Sasaki and co-workers¹⁰, suggesting that the inter-molecular disulfide bond via C500 is not critical for dimer formation.

Thermal stability was then examined for all constructs (Fig. 3). There was insignificant change in signal over the temperature range 20–90 ˚C for all of the constructs except for G592-Q710 and K1061-P1221. This suggests the constructs are extremely thermostable, which is consistent with them having multiple disulfide bonds (Table 2). The lower thermal transition shift observed for G592-Q710, could be due to the presence of an unstructured region between the two EGF folds (supplementary Fig. S1c). The thermal stability for the K1061-P1221 construct is relatively lower than other constructs having a single thermal transition shift at 60 ˚C. This could be due to the presence of Domain III, which comprises of 70% of the construct. Domain III has only a single disulfide bond (C1110-C1116) and hence might be expected to be less thermally stable.

The secondary structure of the protein constructs was examined using far‐ultraviolet circular dichroism (CD) spectroscopy. The construct S427-G545, containing only the anaphylatoxin-like modules, is predicted to be predominantly α-helical (supplementary Fig. S1b). The CD spectra for S427-G545 for both wild-type and the C500L mutant was consistent with the predicted structural information, showing a positive peak near 193 nm and negative peaks near 208 nm and 222 nm (Fig. 4a,b). The CD spectral data for most of the fibulin-2 constructs containing two EGF-like domains, (D709-V802, T800-V896, V894-V981 and E979-L1063) expect G592-Q710, showed a sharp negative peak near 195 nm. This indicates that these constructs lack significant regular secondary structure components (Fig. 4c–g) which agrees with the AlphaFold predicted structures (supplementary Fig. S1c). In contrast, the CD spectra for construct K1061-P1221, which has a single EGF-like domain and Domain III, showed a positive peak near 190–195 nm and a negative peak near 210–220 nm (Fig. 4h) which suggests the presence of significant amounts of antiparallel β-pleated sheets. This agrees with the predicted structure of the C-terminal Domain III (supplementary Fig. S1d). Hence all fibulin-2 constructs exhibited CD spectra consistent with their predicted structure, which, when combined with the MS data that showed all cysteines are in disulfides, suggests that all are natively folded.

To further investigate the thermal stability of these constructs, changes in secondary structural elements were examined using CD spectrometry. The anaphylatoxin-like modules containing construct S427-G545, wild-type and C500L mutant, showed no or very minor change in CD spectra (Fig. 4a,b) which indicated that they are highly thermostable. In contrast most of the EGF-like domain containing constructs showed apparent conformational changes with a shift in the position of the negative peak to higher wavelengths at higher temperatures. The transition temperature was above 60 ˚C for most of the constructs and showed a single thermal transitional shift (Fig. 4c–g). A more significant change in CD spectra can be seen for K1061-P1221 between room temperature and high temperature, which is consistent with the thermofluor data (Fig. 3 and Fig. 4h). When the pre-heated samples (at 90 ˚C) were cooled to room temperature, the CD spectra for all except K1061-P1221 constructs shifted back to their native room-temperature state (supplementary Fig. S3). This efficient “refolding” could either be due to this being a conformational change rather than denaturation (consistent with the thermofluor data) or could be due to the presence of multiple disulfide bonds allowing the denatured protein to rapidly and efficiently readopt its native state.

Structural studies

As no crystal structures were previously reported for any fibulin, we then attempted to crystallize the S427-G545 construct. We were able to obtain diffracting crystals and solved the structure in the P2₁ space group at 2.2 Å resolution. The final model included 8 protein copies (A-H) and 108 water molecules in the asymmetric unit. A pseudo-translational symmetry was detected in the crystal form leading to the relatively high R factors at the end of the refinement; these being 24.28% (R_work) and 29.74% (R_free) (supplementary Table S1). However, the electron density maps were well defined for most of the protein chains (supplementary Fig. S4a). The structure included four dimers (AB, CD, EF and GH) in the asymmetric unit, with each dimer in a local two-fold symmetry. Pseudo-translational NCS symmetry was found between dimers AB, CD and EF, GH, respectively. The Cα trace of all the 8 copies in the asymmetric unit were very similar and superimposed with each other with r.m.s.d. values less than 1 Å. The overall structure of each chain is predominantly α-helical, which is consistent with the CD data (Fig. 4a). There are four alpha helices (T428-D445 for α1, D460-E488 for α2, L504-A520 for α3, Y533-E544 for α4) which all run in the same direction (Fig. 5). The loop structures between α1 and α2 (N446-S459) and between α2 and α3 (G499-S503) were incompletely modelled in all 8 chains indicating the flexible nature of these regions. The complete N-terminal His-tag with the initiating methionine was visible in chains A and E. The r.m.s.d. value for Cα atoms and for all atoms were 1.7 and 2.9, respectively, when chain A of the crystal structure was compared with the corresponding region of the AlphaFold2 (alphafold.ebi.ac.uk) model of mouse fibulin-2.

The crystal structure shows a compact one-domain alpha helical structure rather than the previously suggested three separate domains each having anaphylatoxin-like motif⁴. The structure has a disulfide bond architecture which stabilizes the three anaphylatoxin modules (Fig. 5b and Fig. 6b). The first anaphylatoxin-like fold includes the α1-helix (including C435 and C436), α1-α2 loop (including C449), and the N-terminal half of the long α2-helix (including C462, C469 and C470) and the fold is stabilized by three disulfides C435-C462, C436-C469 and C449-C470. These disulfide bonds are referred to as 1-SS1, 1-SS2 and 1-SS3, respectively, from now on. The second anaphylatoxin-like fold includes the C-terminal half of α2-helix (including C479), α2-α3 loop (including C492) and N-terminal half of α3-helix (including C508 and C509). This fold is stabilized by two disulfides (C479-C508 and C492-C509 referred to as 2-SS2 and 2-SS3, respectively). The third anaphylatoxin-like fold consists of the C-terminal half of the α3-helix (including C511 and C512), α3-α4 loop (including C525) and the complete α4-helix (including C535, C542 and C543), and is again stabilized by three disulfides namely C511-C535 (3-SS1), C512-C542 (3-SS2) and C525-C543 (3-SS3). In total, 8 intra-molecular disulfide bonds are found in the domain. Most of them are clearly defined by the electron density (such as 1-SS3 (C436-C469) shown in supplementary Fig. S4a) and they have the same conformation in all 8 copies of the protein in the asymmetric unit.

Structural alignment analysis using the DALI sever²⁸ does not show any homologous structure with a significant Z-score value. A structural comparison between the solved crystal structure with previously solved structures of wild-type anaphylatoxin domain, human C3a (PDB code 4HW5)²⁹ and murine C5a (PDB code 4P3A)³⁰, was done. These structures have a similar fold, having four α-helices and three conserved disulfide bonds (Fig. 6a and Fig. S5a) which help in stabilizing the protein. These crystal structures, 4HW5 and 4P3A, also have similar disulfide bond arrangement having the 1st, 2nd cysteine residues located on α2-helix, the 3^rd cysteine on α3-helix and the cysteine residues which form disulfide bonds with them being all located on α4-helix, as shown in Fig. 6a and Fig. S5. In contrast, the fibulin-2 anaphylatoxin-like modules lack the α1-helix which is found in the anaphylatoxin domain structures (Fig. 6a). Additionally, two α-helices, namely α2 and α3, in fibulin-2 are part of two different anaphylatoxin modules (Fig. 6b). All of this suggests that in fibulin-2, the three anaphylatoxin modules are all embedded into a single-domain four-helix structure forming a dimer with covalent bonding via C500.

The inter-molecular disulfide bond between the two chains between C500 cannot be confirmed reliably with this crystal structure. Cys500 locates in the flexible α2-α3 loop, which is only partly visible in the electron density maps. In the current crystal structure, Cys500 has been modelled in chains B and F. In these chains, electron density is clear for C500-L504 of the α2-α3 loop. However, the N-terminal region of this loop, A494-T499, in those two chains has only weak density and it was not possible to reliably build this region (supplementary Fig. S4b). However, the (weak) electron density near C500 in chain F suggests the presence of an intermolecular disulfide with C500 of chain E (supplementary Fig. S4b). When combined with the non-reducing SDS-PAGE (Fig. 2b) and MS (Table 2), which both indicate that all of the protein is in a disulfide linked homodimer, the crystal structure data implies that the inter-subunit disulfide is formed, but that it is not stabilizing the flexible nature of the α2-α3 loop.

To examine potential dimerization of the full-length protein, we calculated dimeric models of the complete fibulin-2 molecule using AlphaFold multimer³¹. This modelling resulted in five different proposed dimeric structures. The one with the highest probability score is shown supplementary Fig. S6. Interestingly, all five models show a common dimerization formation with anaphylatoxin-like modules at the interface between monomers. Furthermore, the formed anaphylatoxin-like region dimer is very similar with the crystal structure presented in this study. This further suggests that the anaphylatoxin-like module region is a key player in the dimerization of the fibulin-2 protein. Interestingly, none of the models showed head-to-tail dimerization as predicted previously¹⁰.

Methods

Cloning, expression, and purification

The use of bioinformatics tools for defining construct boundaries and plasmid design as well as the construction of plasmids was performed as previously described²⁵. Briefly, all the plasmids used in this study (shown in Table 1), have an N‐terminal hexa-histidine (MH₆M‐) tag preceding the first amino acid of the protein sequence. To confirm that the disulfide bond was not essential for dimer formation we made the cysteine 500 to leucine (C500L) mutation in the S427-G545 construct. The construction of mutant was made using the QuikChange site-directed mutagenesis kit (Agilent) according to the manufacturer’s instructions. All genes were fully sequenced prior to expression.

The plasmid containing the gene of interest (Table 1) along with the CyDisCo plasmid (pMJS205) containing, Erv1p and PDI³², were co-transformed in BL21 (DE3) (from Stratagene) E. coli strain and grown in terrific broth autoinduction media (Formedium) at 30 ºC prior to induction and at 15 °C during the induction phase. Bacterial cell pellets from 1 L cultures were re-suspended in 400 mL of lysis buffer (20 mM Tris–HCl pH 8, 150 mM NaCl, 2 mM CaCl₂, 15 mM imidazole and 20 µg/mL DNase). The cells were lysed by sonication for a total duration of 90 s with 5 s pulse on and 25 s pulse off at 40% amplitude. The lysate was centrifuged at 30,000 × g, 4 °C for 40 min, the supernatant collected and filtered through a 0.45 µm filter.

The first step of purification used a HiTrap™ 5 mL chelating HP column (GE Healthcare) for nickel immobilized metal affinity chromatography (IMAC), where 1 column volume (CV) is equal to 5 mL. The column was first washed with 5 CV of water followed by 1 CV of nickel chloride. Excess unbound nickel was removed by washing the column with 5 CV of water. The column was then equilibrated with 5 CV equilibration buffer containing 20 mM Tris–HCl pH 8, 150 mM NaCl, 2 mM CaCl₂. The soluble filtered supernatant fraction was then loaded onto the column followed by a 3 CV equilibration buffer. The column was washed using 10 CV of 20 mM Tris–HCl pH 8, 150 mM NaCl, 2 mM CaCl₂, 50 mM imidazole. The protein of interest was eluted by applying a linear gradient of 20 mM Tris–HCl pH 8, 150 mM NaCl, 2 mM CaCl₂, 300 mM imidazole over 10 CV. The IMAC elution fractions containing the target protein were pooled and concentrated. Approximately 1 mL of the protein was then injected into the HiLoad™ 16/600 Superdex™ 75 pg column (GE Healthcare) for size exclusion chromatography (SEC) purification. The column was pre- equilibrated with 20 mM Tris–HCl pH 8, 150 mM NaCl, 2 mM CaCl₂. Calcium chloride was absent during the purification for the S427-G545 constructs.

Biophysical characterization

The characterization of purified protein samples by sodium dodecyl sulfate- polyacrylamide gel electrophoresis (SDS-PAGE), mass spectrometry (MS), far‐ultraviolet circular dichroism (CD) and thermofluor assay, were followed as described previously²⁵. The MS analysis shown here was done using the NEM treated sample. It showed the same results as native samples. Additionally, an addition of 57 Da was observed for all constructs which is most likely is due to the presence of bound nickel from the IMAC purification. Size exclusion chromatography coupled with multi angle light scattering (SEC-MALS) was performed to determine the oligomeric state of the wild-type and mutant of S427-G545 construct. The analysis was carried out using a Wyatt Mini DAWN instrument connected to a Shimadzu HPLC system. The SEC purified protein sample (50 µL of ~ 2 mg/mL) was injected into the pre-equilibrated superdex 200 Increase 10/300 column with 20 mM Tris–HCl pH 8, 150 mM NaCl. The flow rate was kept at 0.5 mL/min and the analysis of MALS data was done using ASTRA software version 7.0 (Wyatt technologies).

Structural characterization

Structural studies of the pAS79 construct (MH₆M-S427-G545) having a protein concentration of 10 mg/mL was carried out using x-ray crystallography. Initially crystallization screening was set up on 96-well triple sitting drop plate (sptlabtech) using a Mosquito LCP nanodispenser (sptlabtech). The crystallization screens used included Oulu Factorial (in house screen) and JCSG plus (Molecular Dimensions). The volume of reservoir solution (RS) used was 50 µL and the crystallization drops composition was in 1:2, 1:1 and 2:1 ratio for protein:RS. The crystallization plates were kept at room temperature (22 °C) and formulatrix rock imagers RI54 was used for plate imaging. Crystallization drops were viewed using IceBear software³³.

An initial crystallization hit was found using 0.2 M potassium nitrate, 20% w/v PEG 3350. The condition was further optimized by using different protein concentrations ranging from 2.5—10.0 mg/mL and varying the PEG 3350 concentration from 2—25%. A single crystal appeared in a crystallization drop with a 1:2 (protein:RS) ratio composed of 0.2 M potassium nitrate, 2% w/v PEG 3350 with a stock concentration of protein of 10 mg/mL. The protein crystal was mounted in a cryoloop (Hampton) with the addition of 25% glycerol in the reservoir solution prior to flash freezing in liquid nitrogen. The x-ray diffraction data from the frozen crystal was collected to 1.97 Å resolution at Diamond Light Source (DLS, Didcot, United Kingdom) beamline I-04. The auto-processed data (48.40–1.97 Å) from Xia2 (3dii) pipeline was used for structure solution³⁴. The data collection parameters and structure solution parameters are shown in supplementary Table S1.

The anaphylatoxin-like region (S427-G545) of the AlphaFold predicted model of the complete fibulin-2 from mouse (accession code: AF-P37889-F1, https://alphafold.ebi.ac.uk)^26,27, was used as a search model in the molecular replacement calculations by Phaser of the Phenix suite^35,36. Initial refinement steps were done with Phenix.refine³⁷ and the iterative model building by using COOT³⁸ by using the data up to 2.2 Å resolution. The diffraction data in the resolution range 2.19–1.97 Å were not used for the refinement as the reflections were very weak (I/sigma below 1.8, R_merge and R_pim more than 100% and 70% respectively) and their inclusion did not bring any improvements to the electron density maps. The last refinement was done with Refmac³⁹ in the CCP4 Cloud⁴⁰. The final coordinates were validated with MolProbity⁴¹. Images of the structures were prepared using Pymol software⁴².

Modelling calculations

The dimeric models for the complete full-length mouse fibulin-2 were calculated by AlphaFold-multimer³¹ in the COSMIC² cloud platform for structural biology research and education⁴³.

Data availability

The data presented in this study is contained within the article and supplementary material. The structure is available in (PDB code: 8R5W).

References

Mahajan, D. et al. Role of fibulins in embryonic stage development and their involvement in various diseases. Biomolecules https://doi.org/10.3390/biom11050685 (2021).
Article PubMed PubMed Central Google Scholar
Argraves, W. S., Dickerson, K., Burgess, W. H. & Ruoslahti, E. Fibulin, a novel protein that interacts with the fibronectin receptor β subunit cytoplasmic domain. Cell 58(4), 623–629. https://doi.org/10.1016/0092-8674(89)90097-4 (1989).
Article CAS PubMed Google Scholar
Argraves, W. S., Tran, H., Burgess, W. H. & Dickerson, K. Fibulin is an extracellular matrix and plasma glycoprotein with repeated domain structure. J. Cell. Biol. 111(6), 3155–3164. https://doi.org/10.1083/jcb.111.6.3155 (1990).
Article CAS PubMed Google Scholar
Pan, T. C. et al. Structure and expression of fibulin-2, a novel extracellular matrix protein with multiple EGF-like repeats and consensus motifs for calcium binding. J. Cell Biol. 123(5), 1269–1277. https://doi.org/10.1083/jcb.123.5.1269 (1993).
Article CAS PubMed Google Scholar
Zhang, R. Z. et al. Fibulin-2 (FBLN2): Human cDNA sequence, mRNA expression, and mapping of the gene on human and mouse chromosomes. Genomics 22(2), 425–430. https://doi.org/10.1006/geno.1994.1404 (1994).
Article CAS PubMed Google Scholar
Sasaki, T., Mann, K., Murphy, G., Chu, M. L. & Timpl, R. Different susceptibilities of fibulin-1 and fibulin-2 to cleavage by matrix metalloproteinases and other tissue proteases. Eur. J. Biochem. 240(2), 427–434. https://doi.org/10.1111/j.1432-1033.1996.0427h.x (1996).
Article CAS PubMed Google Scholar
Timpl, R., Sasaki, T., Kostka, G. & Chu, M. L. Fibulins: A versatile family of extracellular matrix proteins. Nat. Rev. Mol. Cell Biol. https://doi.org/10.1038/nrm1130 (2003).
Article PubMed Google Scholar
Argraves, W. S., Greene, L. M., Cooley, M. A. & Gallagher, W. M. Fibulins: Physiological and disease perspectives. EMBO Rep. 4(12), 1127–1131. https://doi.org/10.1038/sj.embor.7400033 (2003).
Article CAS PubMed PubMed Central Google Scholar
Kobayashi, N. et al. A comparative analysis of the fibulin protein family: Biochemical characterization, binding interactions, and tissue localization. J. Cell Biol. 282(16), 11805–11816. https://doi.org/10.1074/jbc.M611029200 (2007).
Article CAS Google Scholar
Sasaki, T. et al. Dimer model for the microfibrillar protein fibulin-2 and identification of the connecting disulfide bridge. EMBO J. 16(11), 3035–3043. https://doi.org/10.1093/emboj/16.11.3035 (1997).
Article CAS PubMed PubMed Central Google Scholar
Downing, A. K. et al. Solution structure of a pair of calcium-binding epidermal growth factor-like domains: Implications for the Marfan syndrome and other genetic disorders. Cell 85(4), 597–605. https://doi.org/10.1016/s0092-8674(00)81259-3 (1996).
Article CAS PubMed Google Scholar
Sasaki, T. et al. Structural characterization of two variants of fibulin-1 that differ in nidogen affinity. J. Mol. Biol. 245(3), 241–250. https://doi.org/10.1006/jmbi.1994.0020 (1995).
Article CAS PubMed Google Scholar
Giltay, R., Timpl, R. & Kostka, G. Sequence, recombinant expression and tissue localization of two novel extracellular matrix proteins, fibulin-3 and fibulin-4. Matrix Biol. 18(5), 469–480. https://doi.org/10.1016/S0945-053X(99)00038-4 (1999).
Article CAS PubMed Google Scholar
Olin, A. I. et al. The proteoglycans aggrecan and versican form networks with fibulin-2 through their lectin domain binding. J. Biol. Chem. 276(2), 1253–1261. https://doi.org/10.1074/jbc.M006783200 (2001).
Article CAS PubMed Google Scholar
Reinhardt, D. P. et al. Fibrillin-1 and fibulin-2 interact and are colocalized in some tissues. J. Biol. Chem. 271(32), 19489–19496. https://doi.org/10.1074/jbc.271.32.19489 (1996).
Article CAS PubMed Google Scholar
Utani, A., Nomizu, M. & Yamada, Y. Fibulin-2 binds to the short arms of laminin-5 and laminin-1 via conserved amino acid sequences. J. Biol. Chem. 272(5), 2814–2820. https://doi.org/10.1074/jbc.272.5.2814 (1997).
Article CAS PubMed Google Scholar
Sasaki, T. et al. Short arm region of laminin-5 γ2 chain: Structure, mechanism of processing and binding to heparin and proteins. J. Mol. Biol. 314(4), 751–763. https://doi.org/10.1006/jmbi.2001.5176 (2001).
Article CAS PubMed Google Scholar
Ries, A., Göhring, W., Fox, J. W., Timpl, R. & Sasaki, T. Recombinant domains of mouse nidogen-1 and their binding to basement membrane proteins and monoclonal antibodies. Eur. J. Biochem. 268(19), 5119–5128. https://doi.org/10.1046/j.0014-2956.2001.02437.x (2001).
Article CAS PubMed Google Scholar
Hopf, M., Göhring, W., Mann, K. & Timpl, R. Mapping of binding sites for nidogens, fibulin-2, fibronectin and heparin to different IG modules of perlecan. J. Mol. Biol. 311(3), 529–541. https://doi.org/10.1006/jmbi.2001.4878 (2001).
Article CAS PubMed Google Scholar
Sasaki, T. et al. Tropoelastin binding to fibulins, nidogen-2 and other extracellular matrix proteins. FEBS Lett. 460(2), 280–284. https://doi.org/10.1016/S0014-5793(99)01362-9 (1999).
Article CAS PubMed Google Scholar
Pérez-Rico, C. et al. Tropoelastin and fibulin overexpression in the subepithelial connective tissue of human pterygium. Am. J. Ophthalmol. 151(1), 44–52. https://doi.org/10.1016/j.ajo.2010.07.012 (2011).
Article CAS PubMed Google Scholar
Debeer, P. et al. The fibulin-1 gene (FBLN1) is disrupted in a t(12;22) associated with a complex type of synpolydactyly. J. Med. Genet. 39(2), 98–104. https://doi.org/10.1136/jmg.39.2.98 (2002).
Article CAS PubMed PubMed Central Google Scholar
Weigell-Weber, M. et al. Genomewide homozygosity mapping and molecular analysis of a candidate gene located on 22q13 (fibulin-1) in a previously undescribed vitreoretinal dystrophy. Arch. Ophthalmol. 121(8), 1184–1188. https://doi.org/10.1001/archopht.121.8.1184 (2003).
Article CAS PubMed Google Scholar
Zhang, H., Hui, D. & Fu, X. Roles of fibulin-2 in carcinogenesis. Med. Sci. Monit. https://doi.org/10.12659/MSM.918099 (2020).
Article PubMed PubMed Central Google Scholar
Sohail, A. A., Gaikwad, M., Khadka, P., Saaranen, M. J. & Ruddock, L. W. Production of extracellular matrix proteins in the cytoplasm of E. coli: making giants in tiny factories. Int. J. Mol. Sci. https://doi.org/10.3390/ijms21030688 (2020).
Article PubMed PubMed Central Google Scholar
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature. 596(7873), 583–589. https://doi.org/10.1038/s41586-021-03819-2 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Varadi, M. et al. AlphaFold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 50(D1), D439–D444. https://doi.org/10.1093/nar/gkab1061 (2022).
Article CAS PubMed Google Scholar
Holm, L. & Laakso, L. M. Dali server update. Nucl. Acids Res. 44(W1), W351–W355. https://doi.org/10.1093/nar/gkw357 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bajic, G., Yatime, L., Klos, A. & Andersen, G. R. Human C3a and C3a desArg anaphylatoxins have conserved structures, in contrast to C5a and C5a desArg. Protein Sci. 22(2), 204–212. https://doi.org/10.1002/pro.2200 (2013).
Article CAS PubMed Google Scholar
Schatz-Jakobsen, J. A. et al. Structural and functional characterization of human and murine C5a anaphylatoxins. Acta Crystallogr. D Biol. Crystallogr. 70(6), 1704–1717. https://doi.org/10.1107/S139900471400844X (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Evans, R. et al. Protein complex prediction with AlphaFold-Multimer. BioRxiv https://doi.org/10.1101/2021.10.04.463034 (2021).
Article PubMed PubMed Central Google Scholar
Gąciarz, A. et al. Systematic screening of soluble expression of antibodies and antibody fragments in the cytoplasm of E. coli. Microb. Cell Fact. https://doi.org/10.1186/s12934-016-0419-5 (2016).
Article PubMed PubMed Central Google Scholar
Daniel, E. et al. IceBear: An intuitive and versatile web application for research-data tracking from crystallization experiment to PDB deposition. Acta Crystallogr. D Struct. Biol. https://doi.org/10.1107/S2059798320015223 (2021).
Article PubMed PubMed Central Google Scholar
Winter, G., Lobley, C. M. & Prince, S. M. Decision making in xia2. Acta Crystallogr. D Struct. Biol. 69(7), 1260–1273. https://doi.org/10.1107/S0907444913015308 (2013).
Article ADS CAS Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40(4), 658–674. https://doi.org/10.1107/S0021889807021206 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Adams, P. D. et al. Phenix: a comprehensive python-based system for macromolecular structure solution. Acta Crystallogr. D Struct. Biol. 66(2), 213–221. https://doi.org/10.1107/S0907444909052925 (2010).
Article ADS CAS Google Scholar
Afonine, P. V. et al. Towards automated crystallographic structure refinement with phenix refine. Acta Crystallogr. D. Struct. Biol. https://doi.org/10.1107/S0907444912001308 (2012).
Article Google Scholar
Emsley, P. et al. Features and development of Coot. Acta Crystallogr. D Struct. Biol. 66(4), 486–501. https://doi.org/10.1107/S0907444910007493 (2010).
Article ADS CAS Google Scholar
Murshudov, G. N. et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D Struct. Biol. 67(4), 355–367. https://doi.org/10.1107/S0907444911001314 (2011).
Article ADS CAS Google Scholar
Krissinel, E. et al. CCP4 Cloud for structure determination and project management in macromolecular crystallography. Acta Crystallogr. D Struct. Biol. 78(9), 1079–1089. https://doi.org/10.1107/S2059798322007987 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, V. B. et al. MolProbity: All-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Struct. Biol. 66(1), 12–21. https://doi.org/10.1107/S0907444909042073 (2010).
Article ADS CAS Google Scholar
DeLano, W. L. Pymol: An open-source molecular graphics tool. CCP4 Newsl. Protein Crystallogr. 40(1), 82–92 (2002).
Google Scholar
Cianfrocco, M. A., Wong-Barnum, M., Youn, C., Wagner, R. & Leschziner, A. COSMIC2: A science gateway for cryo-electron microscopy structure determination. PEARC https://doi.org/10.1145/3093338.3093390 (2017).
Article Google Scholar
Wilkins, M. R. et al. Protein identification and analysis tools in the ExPASy server. Methods Mol. Biol. https://doi.org/10.1385/1-59259-584-7:531 (1999).
Article PubMed Google Scholar

Download references

Acknowledgements

This research was funded by Academy of Finland, grant number 272573 and Biocenter Oulu (BCO). The use of the facilities and expertise of the BCO Structural Biology, BCO Sequencing Center and BCO Molecular Biophysics core facilities, members of Biocenter Finland, Instruct-ERIC Centre Finland and FINStruct, is gratefully acknowledged. We also thank the expert support of the Proteomics and Protein Analysis core facility of the BCO. The authors would like to thank Diamond Light Source for beamtime (proposal mx26794-28), and the staff of beamlines I-04 for assistance with crystal testing and data collection.

Author information

Authors and Affiliations

Faculty of Biochemistry and Molecular Medicine, University of Oulu, 90220, Oulu, Finland
Anil A. Sohail, M. Kristian Koski & Lloyd W. Ruddock
Biocenter Oulu, University of Oulu, 90220, Oulu, Finland
M. Kristian Koski & Lloyd W. Ruddock

Authors

Anil A. Sohail
View author publications
Search author on:PubMed Google Scholar
M. Kristian Koski
View author publications
Search author on:PubMed Google Scholar
Lloyd W. Ruddock
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, LWR; methodology, AAS and MKK.; validation, AAS and MKK.; formal analysis, AAS and MKK.; investigation, AAS.; data curation, AAS; writing-original draft preparation, AAS.; writing-review and editing, AAS, MKK and LWR; visualization, AAS; supervision, LWR; project administration, LWR; funding acquisition, LWR. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Lloyd W. Ruddock.

Ethics declarations

Competing interests

A patent for the production system used to make the protein for structural studies using sulfhydryl oxidases in the cytoplasm of E. coli is held by the University of Oulu: Method for producing natively folded proteins in a prokaryotic host (Patent number 9238817; date of patent January 19th, 2016). Inventor: Lloyd Ruddock. The other authors have no conflicts of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sohail, A.A., Koski, M.K. & Ruddock, L.W. Biophysical and structural studies of fibulin-2. Sci Rep 14, 15091 (2024). https://doi.org/10.1038/s41598-024-64931-7

Download citation

Received: 21 February 2024
Accepted: 14 June 2024
Published: 02 July 2024
Version of record: 02 July 2024
DOI: https://doi.org/10.1038/s41598-024-64931-7