




版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
IntroductiontoDatabases
DanielaPuiuApplicationsSpecialistCenterfortheStudyofBiologicalComplexity,VCUdpuiu@804-827-0952qq群發(fā)GeneralConcepts DatabasedefinitionOrganizedcollectionoflogicallyrelateddataDataKnownfactsTypes:text,graphics,images,sound,videosDatabasemanagementsystem(DBMS)SoftwarepackagefordefiningandmanagingadatabaseDatabaseExamples ClassrosterHospitalpatientsLiterature(publishedarticlesinacertainfield)GenomicinformationProteinstructureTaxonomySinglenucleotidepolymorphismExample:MicrobialDatabase Organism:NameAccessionnumberGenomesizeGC%ReleasedateGenomecenterSequenceGene(proteincodingregions):NameAccessionnumberOrganismLocationonthechromosome(start,end)StrandSizeProductSequenceDataabouttheproteincodingregionsinthemicrobialgenomessequencedsofar.DatabaseModelsFlatfiles ‘60Hierarchical ‘60Network ‘70Relational ‘80Objectoriented ‘90Objectrelational ‘90Webenabled ‘90DatabaseTypes(cont.)TypeTypicalnumberofusersTypicalarchitectureTypicalsizePersonal1Desktop/Laptop/PDAMBWorkgroup5-25Client/server:2tierMB-GBDepartment25-100Client/server:3tierGBEnterprise>100Client/server:distributedGB-TBInternet>1000Websever&applicationservers MB-GBFlatFilesCharacteristics:DataisstoredasrecordsinregularfilesRecordsusuallyhaveasimplestructureandfixednumberoffieldsForfastaccessmaysupportindexingoffieldsintherecordsNomechanismsforrelatingdatabetweenfilesOneneedsspecialprogramsinordertoaccessandmanipulatethedataFlatFilesExampleMicrobialdatabase:Genbankformat:EscherichiacoliK12StreptococcuspneumoniaeR6…Fastaformat:multiplefilesEscherichiacoliK12:genome,genes,genepositionsStreptococcuspneumoniaeR6:genome,genes,genepositions…Datamanipulation:Sequenceextraction,searchIndexingFormatconversion…RelationalDatabaseCharacteristics:Dataisorganizedintotables:rows&columnsEachrowrepresentsaninstanceofanentityEachcolumnrepresentsanattributeofanentityMetadatadescribeseachtablecolumnRelationshipsbetweenentitiesarerepresentedbyvaluesstoredinthecolumnsofthecorrespondingtables(keys)AccessiblethroughStandardQueryLanguage(SQL)Enterprisedatamodel GraphicalrepresentationofthehighlevelentitiesExample:MicrobialdatabaseeachorganismhasmultiplecorrespondinggenesOne:ManyrelationOrganismGene1mMetadataDatathatdescribesthepropertiesorcharacteristicsofotherdataDoesnotincludesampledataAllowsdatabasedesignersanduserstounderstandthemeaningofthedataMetadata&DataTableNameTypeMaxLengthDescriptionNameAlphanumeric100OrganismnameSizeInteger10Genomelength(bases)GcFloat5PercentGCAccessionAlphanumeric10AccessionnumberReleaseDate8ReleasedateCenterAlphanumeric100GenomecenternameSequenceAlphanumericVariableSequenceOrganismNameSizeGcAccessionReleaseCenterSequenceEscherichiacoliK124,640,00050NC_00091309/05/1997Univ.WisconsinAGCTTTTCATT…StreptococcuspneumoniaeR62,040,00040NC_00309809/07/2001EliLillyandCompanyTTGAAAGAAAA……Metadata&DataTable(cont.)NameTypeMaxLengthDescriptionNameAlphanumeric100GenenameAccessionAlphanumeric10GeneaccessionnumberOAccesionAlphanumeric10OrganismaccessionnumberStartInteger10GenestartEndInteger10GeneendStrandCharacter1GenestrandProductAlphanumeric1000GeneannotationSequenceAlphanumericVariableGenesequenceGeneNameAccessionOAccessionStartEndStrandProductSequencethrL16127995NC_000913190255+theoperonleaderpeptideMKRI…thrA16127996NC_0009133372799+homoserinedehydrogenaseIMRVL…transposase_A15902058NC_0030982020720554+transposaseMWYN…Relationships UsedtoconnecttablesField(s)thathavethesamevalueintherelatedtablesOrganism.Accession=Gene.OAccessionOrganism.AccessionUniquePrimarykeyGene.OAccessionNotuniqueSecondarykeySQLANSI(AmericanNationalStandardsInstitute)standardcomputerlanguageforaccessingandmanipulatingdatabasesystems.SQLstatementsareusedtoretrieveandupdatedatainadatabase.Includes:DataManipulationLanguage(DML)DataDefinitionLanguage(DDL)DataManipulationLanguageSyntaxforexecutingqueries,updating,inserting,anddeletingrecords.SELECT-extractsdatafromoneormoretableINSERTINTO-insertsnewdataintoatableUPDATE-updatesdatainatableDELETEFROM-deletesdatafromatableDMLExampleSelectallEscherichiacoliK12geneswhichareinthe1MB-2MBregionofthechromosome: SELECT* FROMOrganism,Gene WHERE Organism.Name=“EscherichiacoliK12”AND Organism.Accession=Gene.OAccessionAND Gene.Start>=1,000,000AND Gene.End<=2,000,000DMLExample(cont.)INSERTINTOGene(Name,Accession,OAccession,Start,End,Strand,Sequence)VALUES(“thrL”,16127995,”NC_000913”,190,255,’+’,”throperonleaderpeptide”,“MKRI…”)UPDATEGeneSETStart=160WHEREAccession=”NC_000913”DELETEFROMGeneWHEREAccession=”NC_000913”DataDefinitionLanguageSyntaxforcreating,editing,deleting:DatabasesTablesViewsIndexesConstraintsUsersPrivilegesDDLExamplesCREATEDATABASEMicrobial;CREATETABLEOrganism( Namevarchar(100) Sizeint(10) Gc decimal(5) Accessionvarchar(10) Releasedate(8) Centervarchar(100));ALTERTABLEOrganismADDSequencevarchar;DROPTABLEOrganism;DBMSSoftwarepackagefordefiningandmanagingadatabase.Examples:Proprietary:MSAccess,MSSQLServer,DB2,Oracle,SybaseOpensource:MySql,PostgreSQLDBMSAdvantages Program-dataindependenceMinimaldataredundancyImproveddataconsistency&quality
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 員工月工作總結(jié)合集15篇
- 2025貴州鐵路投資集團(tuán)有限責(zé)任公司招聘35人模擬試卷及參考答案詳解一套
- 2025年臺(tái)州事業(yè)單位真題
- 2025年衢州常山縣公開招聘專職社區(qū)工作者12人模擬試卷及答案詳解(名校卷)
- 2025年西安航天基地公辦學(xué)校教職工招聘(74人)模擬試卷及一套完整答案詳解
- 政府和合作社合作運(yùn)營協(xié)議書7篇
- 2025北京中關(guān)村第三小學(xué)教育集團(tuán)招聘模擬試卷及參考答案詳解
- 2025年甘肅省臨夏州臨夏縣招聘警務(wù)輔助人員30人考前自測(cè)高頻考點(diǎn)模擬試題及答案詳解(有一套)
- 2025年煙臺(tái)萊陽市衛(wèi)生健康局所屬事業(yè)單位公開招聘工作人員(35人)考前自測(cè)高頻考點(diǎn)模擬試題及答案詳解(新)
- 2025河南濟(jì)源職業(yè)技術(shù)學(xué)院高層次人才引進(jìn)20人模擬試卷及答案詳解(新)
- 醫(yī)院清洗服務(wù)方案-清洗項(xiàng)目實(shí)施方案設(shè)計(jì)完整流程
- 心理健康考試題及答案
- 鉆探工(高級(jí))職業(yè)技能考試題(附答案)
- 鋰電池、新能源汽車火災(zāi)事故滅火救援處置
- 《公路技術(shù)狀況評(píng)定》課件-任務(wù)六:公路技術(shù)狀況指數(shù)MQI
- 油氣輸送管道穿越工程施工規(guī)范
- 酒店客房入住數(shù)據(jù)分析表格
- 中級(jí)財(cái)務(wù)會(huì)計(jì)知到課后答案智慧樹章節(jié)測(cè)試答案2025年春云南財(cái)經(jīng)大學(xué)
- 2025青海省建筑安全員B證考試題庫及答案
- 新版機(jī)動(dòng)車交通事故責(zé)任強(qiáng)制保險(xiǎn)合同
- 臨床檢驗(yàn)基礎(chǔ)知到智慧樹章節(jié)測(cè)試課后答案2024年秋上海健康醫(yī)學(xué)院
評(píng)論
0/150
提交評(píng)論